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THE EOUATION as 7Gyy) 


Basie. CONCEPTS 


|. Definitions. Examples 


By an ordinary differential equation of order n we mean a relation of the 


form 
FC, yoy os YO) = 0, 


involving a function F of x + 1 variables, an independent variable x, an 
unknown function y = y(x), and the first # derivatives p’,..., y" of this 
function with respect to x. A function y = (x) with derivatives up to order 7 
is called a solution of the differential equation if the substitution 


y= 90), ¥ = 9)... ph = MQ) 


reduces the differential equation to an identity. Unless the converse is explic- 
itly stated, we shall always assume that the quantities x,y, y',...,y' take 
only (finite) real values and all functions are single-valued. The word “ordi- 
nary’ means that the unknown function depends on only one independent 
variable x. We might also study “partial” differential equations, where the 
unknown function depends on several variables. However, this book is 
concerned exclusively with ordinary differential equations, except in the 
Supplement. 

Ordinary differential equations play a key role in science and technology, 
as illustrated by the following two examples: 

Example 1. Describe the motion of a particle moving along the x-axis 
whose velocity at time ¢ is given by the continuous function f(t) and whose 
position at time ¢ = fy IS Xp. 
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Here we must find the solution of the differential equation 


dx : 
—= f(t 
dt 0 


which takes the value Xy at time fo. Obviously this solution ts just 
3 t re e 2 
x(t) = Xo +] L(G) ds. 
est 

Example 2. Given Ry grams of radium at time t), describe the subsequent 
process of radioactive decay. 

Let R = R(r) be the amount of radium present at time f > fp. Since the 
rate of radioactive decay is always proportional to the amount of radium 
present, with some constant of proportionality ¢ > 0, we want the solution 
of the differential equation 


which takes the value R, at time 7p. It is casily verified by direct substitution 
that R= Rens! 
is the required solution. 


In both examples, the differential equation itself is not enough to uniquely 
determine the unknown function, 1.e., in each case there are many functions 
satisfying the same differential equation. However, in each case there is a 
unique solution which takes a given value for a given value of the independent 
variable. Thinking of the independent variable as time, we refer to such a 
supplementary condition imposed on the solution as an initial condition. 

Stated concisely, the basic problem of the theory of differential equations 
is to find all solutions of a given differential equation and study their properties. 
The process of finding solutions of a differential equation ts often called 
integration of the equation. 


2. Geometric Interpretation. Direction Fields and 
Integral Curves 


Consider the differential equation 
y =f, ¥), (1.1) 


where the function f(x, y) is defined on some domain of the xy-plane. 


* By a domain is meant a nonempty point set G such that 
1, Every point of G is an interior point, ic., has a neighborhood consisting entirely of 
points of G; 
2. G is connected, i.c., any two points of G can be joined by a polygonal line lying 
entirely in G. (A polyvonal line is a curve obtained by joining a finite number of line 
segments end to end.) 
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Suppose that through every point (x, y) of G we draw a short line segment with 
slope f(x, y).° This gives a set of directions in G, called the direction field of 
(1.1). Then the problem of solving the differential equation (1.1) has the 
following geometric Interpretation: Fiid all graphs y = (x),a<x<binG 
whose tangents have directions belonging to the direction field of (1.1).? 

From a geometric point of view, the problem as just stated has two 
unnatural features: 


1, By requiring that the slope of the direction field at the point (x, y) of G 
be given by f(x, y), we exclude directions parallel to the y-axis; 


2. By considering only graphs with equations of the form y = (x), we 
exclude curves which are intersected more than once by perpendiculars 
to the x-axis. 


Therefore we now restate the problem in a somewhat more general way. 
First of all, we allow the direction field to be parallel to the y-axis [equiva- 
lently, we allow f(x, y) to become infinite]. Thus, besides the differential 
equation (1.1), we consider the differential equation 


ix : 
Ty TA y), (1.1’) 
where 
: 1 
I (x, y) oer ao oan 
f(x, ¥) 


At points where f and /, are both defined, we can use either (1.1) or (1.1), 
but we use (I.1’) at points where f becomes infinite. (Note that at least one 
of the functions fand f, is always defined, since /, = O01 f= o, while f= 0 
if f; = 0.) In the second place, we allow general curves in parametric form 
instead of graphs. This leads to the following generalization of the problem 
posed above: Find all curves x = (1), vy = ul), «<< 2 whose tangents 
have directions specified by the pair of equations (1.1) and (1.1"). Such curves 
will be called integral curves of the equations (1.1) and (1.1’),* or of the direc- 
tion field specified by (1.1) and (1.1’). Thus the graph of every solution of 
(1.1) is an integral curve of (1.1), (1.1), but an integral curve of (1.1), CV) 
need not be the graph of a solution of (1.1). 


* The two directions of a line segment will not be distinguished. 

7 A curve is said to be a prapé if it can be written in the form y = o(x), a <x <4, 
orx = U(y), ¢ < » << d. The graph is said to be smooth ifo or v is continuously diffteren- 
trable, 

* We shall often use the singular, referring to equation (1.1), (1.1). 
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Remark 1, From now on, by a curve we shall always mean a smooth curve 
with equations of the form xy = 4(), y = wd), a << B.° The restriction to 
smooth curves is equivalent to assuming that our solutions are continuously 
differentiable, and not just differentiable. This class of solutions is large 
enough for all our subsequent needs. 


Remark 2. Sometimes a direction field is specified on some or all of the 
boundary of the domain G, as well as in G itself.® Then integral curves can 
pass through boundary points of G, as well as interior points. 


Remark 3. Henceforth, when dealing with an equation of the form 


—o 


ns TOS pi (1.2) 
dx N(x, y) 


we shall not bother to explicitly write down the equation 


dx _ NG, y) 


_ = f(x, ¥): (1.2') 
dy M(x, y) ( 


However, we shall often write (1.2) in the form 
M dx — Ndy = 0, (1.3) 


which emphasizes the symmetry in x and y. Equation (1.3) specifies a direc- 
tion field at every point where both factors are defined and at least one is 
nonzero. 


Example J. The equation 
dy _y (1.4) 
ax 2% 


defines a direction field everywhere except at the origin. This direction field 1s 


*A curve x = 4(1), y = u(r), «<< 2 Is said to be smooth if A(t) and w(t) are both 
continuously diiterentiable and if 2./%(2) + p’*(r) = 0. (The values « = —o and 8 = +00 
are not excluded.) Every point of a smooth curve x = 2(¢), y = u(t), a <1 < & belongs to 
an arc which is a smooth graph. In fact, given any ft (& < fo < {), at least one of the quan- 
tities 4’(fo), 2(f,) 18 NOnzero, For example, suppose 2’(fo) 0. Then since 7’(r) is continuous, 
it cannot change sign in some interval 4, — ¢ <£ < t, + ¢, and hence we can uniquely 
solve the equation x = 4(1) for ¢ (why?), obtaining a function r = y(x), say. Substituting 
into the equation » = u(r), we find that v == ulyCx)] = (x), Ato — 2) <x < Alto + ©). 
But this is the equation of a (smooth) graph. 

* By a boundary point of a domain G is meant a point which is a limit point of G but 
does not belong to G. The set of all boundary points of G is called the boundary of G. A 
domain together with its boundary is cailed a closed domain. The closed domain consist- 
ing of G and its boundary is called the closure of G, denoted by G. 
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shown schematically in Figure |. Every 
direction in the field passes through the 
origin. Clearly, the function 


yprkx (1.5) 
is a solution of equation (1.4) for ar- 
bitrary finite A. However, the set of all 
integral curves of (1.4) is given by the 
formula 


ax -} by = 0, (1.6) 


FiGure ] 


where a and & are arbitrary constants, 
not both zero. Thus the y-axis is an integral curve of (1.4), but not the graph 
of a solution of (1.4). 

Since (1.4) does not define a direction field at the origin, we cannot call 
(1.5) and (1.6) integral curves at the origin (although they are integral curves 
everywhere else). Therefore it would be more accurate to say that the integral 
curves are half-lines (rays) emanating from the origin (with the origin itself 
excluded), rather than lines passing through the origin. 


y Example 2, The equation 
ee a8 d 
forts ~s \ y x - 
ae ge Oe ee ee a eee (1.7) 
pha foe NS) dx \ 
ee Ga cee . : ioe 
Ch Sole ey defines a direction field everywhere except at the origin. 
ie 3 re ; , : ; ' ? 
ey ted rhis direction field is shown schematically in Figure 2. 
The directions specified at the point (x, +) by the two 
FIGURE 2 equations (1.4) and (1.7) are orthogonal. Clearly, every 


circle with its center at the origin is an integral curve 
of equation (1.7), On the other hand, the solutions of (1.4) are the functions 


po + /R2 — Ee —R<x<R, 
ype-VR—-M,  -R<X<R. 
We conclude this section by introducing some more terminology: 
1. For brevity, we shall often say “the solution passes through the point 


St 
(Xp. 9)" instead of “the graph of the solution passes through the point 


(Xo, ¥9) - 

The function ¢(x, C,,...,C,) involving parameters C,....,C, 1s 
called the general solution of equation (1.1) in a domain G if every 
solution of (1.1) with a graph lying in G is given by 9 for a suitable 
ChoicecoMCy. a Cy. 


bho 
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3. The equation M(x, y) = 0 of an integral curve of equation (1.1), (1.1’) 
is called an integral of (1.1), (1.1/). 
4. An equation 
a 
D(x, y, Cy,...5 Cy) = 0 (1.8) 
involving parameters C;,...,. C,, is called a complere integral of equa- 
tion (1.1), (1.1’) in a domain G if every integral curve of (1.1), (1.1) 
lying in G is given by (1.8) for a suitable choice of Cy,..., C). 

Thus (1.5) is the general solution of equation (1.4) in the whole xy-plane 
minus the y-axis, while (1.6) is the complete integral of (1.4) in the whole 
xy-plane minus the origin. In Example 2, the funcuon 

y= +f R? — x2 
is the general solution of (1.7) in the half-plane y > 0, while the equation 
e+ y= R (1.9) 


is the complete integral of (1.7) in the whole xy-plane minus the origin. It 
will be proved in Sec. 5 that (1.4) has no integral curves other than (1.6), 
while (1.7) has no integral curves other than (1.9). 


Problem I, What domain in the plane has no boundary? 


Problem 2. Draw the integral curves of the following equations: 


d y xy d y lx + ¥| ad y x + |x| ; 

dx |xy}’ dx ox+y’ dx yr bl? 

dy {0 for y # x, dy l for yp #X, 
Gd) = ) 


dx ] for y= 


ev 
- 
a 
ed 


OQ for p=x. 
Find the domain in which each equation defines a direction field. 


Problem 3, Given a smooth curve x = <(f), y = (1), a <1 < b, suppose 
a<a’ <b’ <b. Prove that 


a) Theintervala’ < ¢ < b’is the union ofa finite number of closed intervals 
on each of which the curve is a graph of the form y = 9(x) or x = Y(y), 
with continuously differentiable © and J; 


b) There is a constant « > 0 such that the arc of the curve corresponding 
to the interval ¢’ <7 <7’ +e has no self-intersections for every ¢’ 
(a ct’ <b); 


c) If M(t), tf) is the length of the are of the curve with end points 4, and fy, 
then the ratio I(t), f2)/(f, — f,) is bounded and exceeds a positive constant. 

Problem 4, Are the following requirements independent: 

a) The direction field contains no directions parallel to the y-axis; 

b) Every integral curve is the graph of a function of x? 
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Problem 5, Find the equation of the locus of all points where the solutions 
of (1.1) have maxima and minima, How about inflection points, in the case 
where f(x, ¥) is differentiable? 


Problem 6, Verify that substituting curves or even curves with no self- 
intersections for polygonal lines in the definition of a domain given in footnote 
I, p. 4 results in an equivalent definition. 


Problem 7. Curves are often specified by equations of the form f(x, y) = 0. 
Suppose the function fis defined and continuously differentiable in the whole 
plane, where 


SCs, y) + 30s, y) + fs ¥) #0, (1.10) 


and suppose the set f(x, y) = 0 is nonempty. Verify that the set consists of a 
finite or countable number of disjoint curves, where every finite part of the plane 
can contain only a finite number of curves. Moreover, show that every curve is 
either closed, or else has both end points at infinity with no self-intersections. 
What would the set f(x, ) = 0 be like if the requirement (1.10) were dropped? 
How are these assertions changed if the function fis defined on an arbitrary 
domain G? 


2 


SOME SIMPLE 
DIFFERENTIAL EQUATIONS 


3. The Equation y’ = f(x) 


We begin by studying the particularly simple differential equation 


dy 
rd Us (2.1) 
dx 
distinguishing two cases: 
Case 1. The function f(x) is continuous on (a, 6).1 Then one of the solutions 
of (2.1) is obviously 
a 
v(x) =|" f(E) dé, (2.2) 
"0 
where x, and x belong to the interval (a, 6), and all other solutions differ 


from (2.2) by only an additive constant, 1.e., the general solution of (2.1) is 
given by 


Z bani - Fy 
y(x) =|" E) dé +. 
J %y 
Thus subjecting any given integral curve to shifts parallel to the y-axis 
generates all the other integral curves. Suppose an integral curve is required 


to pass through a given point (Xp, Yo) of the strip a <x < 8. Then this 
uniquely determines the constant C: 


C = Va- 


* We use parentheses to denote open intervals and brackets to denote closed intervals. 
Thus (a, 5) is the interval a < x < 6 and [a, &] the interval @ < x <b. 


[0 
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In other words, one and only one integral curve 


y(x) = yo +] $@) db 


passes through each point (Xp, 9) of the strip a <x < 6. 


Case 2. The function f(x) is continuous on (a,b) except at the point ¢ 
(a<c <b) where f{(-<) becomes infinite as x —» c. Then for x = c we determine 
the direction field by using the equation 


Se Cr) 
In this case, the direction field becomes steeper and steeper as the line x = c 
is approached. However, the situation is the same as in Case | in the open 


Strips a@<x<e andc<x <b. Thus if the point (xg, }'9) belongs to the 
strip a <x <.c, there is a unique integral curve 


"z bon = ~; 
Y= Yor}, [8 a (2.3) 
Zo 
lying in the strip and passing through (xo, ¥9). Suppose the integral 
i od " 2 
|, f@) a, a<x<e (2.4) 


converges as x > c — 0. Then as x ec — 0 the curve (2.3) approaches a 

definite point on the line x = c, as shown in Figure 3(a). If (2.4) diverges, the 

curve approaches the line x = ¢ asymptotically as x > c — 0. In the same 

way, we can study the behavior of the integral curves in the stripe <x < 8. 
Let f(x) > -+ oo as x > c + 0, and suppose the integral 


[* He) dé, Cc <= Xo < D (2.4’) 
~ Xy 


converges as x» c + Oifand only if the integral (2.4) converges as x > c — 0. 


FIGURE 3 
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Then we get the integral curves shown in Figure 3(a) if the integrals 
converge, and those shown in Figure 4 if the integrals diverge. If the integrals 
” converge, there are infinitely many integral curves 
of equation (2.1), (2.1') passing through any given 
point A = (NXg. Vo) in the strip a<x <b, For 
example, suppose a <Xy <c. Then any curve 
of the form ABCD [see Figure 3(a)] is an integral 
curve. 

Another possibility is that the integrals (2.4) 
and (2.4') both converge as x —» ¢ -: 0, but 


r-g X=C x= 
FiGure 4 


f{Q)--O as x—>c—), 
‘ ; ; ‘ eo 

* i . ” . 

{[Q)—> +0 as x—>c+0 


Then the integral curves behave as shown in Figure 3(b). There are infinitely 
many integral curves Ady, ABB,, ACC, ... passing through a given point A 
of the line x = c¢, but there is only one integral curve (say B,B8A) passing 
through a given point (say B,) of the strips @ <x <cande <x < &, since 
curves of the form 8,B8,, with a corner at the point &, are not regarded as 
integral curves." 

If the integrals (2.4) and (2.4’) diverge, there is a unique integral curve 
passing through every point of the strip a <x < b. It is recommended that 
the reader analyze the case where only one of the integrals (2.4), (2.4’) 
converges. 


Problem 1. Investigate the integral curves of the equation )” = f(x) in the 
Strip @ <x <6 in the case where f(x) = I/2(x), 9(c) = 0 and 9(c) exists 
(a <c <4), assuming that ¢(x) is continuous for a <x <b and g(x) #0 
rae. 


Problem 2, Examine the behavior of the integral curves of the following 
equations: 


| | 
ayeas=; by = FSS: 0) 7’ = es; 
Vsinx . Wsin® x / } 


I 


] 
ae Ge eee ¢ Pe elastin 
| ele? e) } en sin = f) 3 x“ sins . 


3 


(in the last case, allow @ to take various values.) In particular, examine the 
integral curves as x —- 0, 


Problem 3. Can the equation y’ = f(x) have a solution on the entire real 
axis if the function f(x) is discontinuous? 


* Recall from Remark 1, p. 6 that integral curves are assumed to be smooth. 
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4. The Equation y’ = f(y) 
The only difference between the differential equation 


dy 

dx 
and the equation (2.1) is that the roles of x and y are reversed. If f(y) is 
continuous and nonzcro for a < y < b, we can write (2.5) in the form 


fy) (2.5) 


dx | 


eee ae 
——« 
* 


dy (y) 
Therefore there is a untque integral curve 


X= Ng + [" UB 
“wo f (4) 
passing through every point (Xp, ¥)) of the strip a < y < 6, and subjecting 
any given integral curve to shifts parallel to the x-axis generates all the other 
integral curves. 
Next suppose f(y) is continuous on (a, 6) except at the point y=c 
(a <c <b) where f(y) —- 0 as y > c. Then there are three cases: 


Led 


gee (2.6) 
“vo f(4) 
diverges as y > c + 0, there is a unique integral curve passing through 
each point of the strip a < y < 6, and the line y = c (itself an integral 
curve) is the asymptote of all the integral curves. 

. If (2.6) converges as y > c + 0 and does not change sign as y passes 
through y = c, there are infinitely many integral curves passing through 
each point of the stripa << y <b. 

3. 1f (2.6) converges as y — c -+ 0 and changes sign as } passes through 
y = c, there are infinitely many integral curves passing through each 
point of the line y = c, but one and only one passing through each 
point of the stripsa <y<cande<p< 0d. 


ho 


These results all follow from Sec. 3, and the corresponding figures can be 
obtained by interchanging the x and y-axes in Figures 3 and 4. 


Problem 1, Investigate the integral curves of the equation y’ = f(y) in the 
case where f“(c) exists. 


Problem 2. Examine the behavior of the integral curves of the following 


equations: 
dy 


Y iy; by 2 = sings oo = tan 
Vom =a [pl*: y)—- = sinv pom = ld. 
_ ie dx ; dx B i 
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Problem 3. Let f(c) = 0 and suppose that arbitrarily close to y = ¢, both 
for y <ecand y >, there are values of y such that f()) <0 and values of y 
such that f(y) > 0. Prove that y = c Is the unique solution of )’ = /(y) passing 
through any point ¥ = xX, y = c¢. 


Problem 4. Prove that every sclution of the equation »” = f(y) is monotonic. 


Problem 3, Let f(y) be continuous for a < y < 8, and suppose there is a 
solution y = ¢(x) of the equation +’ = f(y) such that e(vy) > ¢ as x > +0 
(a <ec <b). Prove that » = cis also a solution. 


Problem 6. Let f(y) be continuous and aa for yy = » < oo, Find neces- 
sary and sufficient conditions for the solutions of the equation y’ = f(y) to have 
asymptotes. In particular, consider the case 


PY) 
10) = GG): 


where P and Q are polynomials. 
Problem 7.3 Give an example of an equation y’ = f(y) with a continuous 
right-hand side whose solutions include two with the following properties: 
1. They are defined and increasing for all x; 
2. Their graphs intersect in a single point. 
Hint. Let the set of zeros of the function f(y) be a perfect nowhere dense set. 
Problem 8. Give an example of two equations 3” = f{(y) > 0 and y’ = 


fly) > 0 with continuous right- -hand sides such that unique solutions exist for 
each equation, but not necessarily for the equation 


y’ = max Lio, fo)}- 


Invent variants of this problem. 


5. Equations with Separated Variables 


By a differential equation with separated variables we mean an equation 

of the form 
dy 
dx 


THEOREM. Suppose fi(x) is continuous for a<x <6 and f(y) is 
continuous and nonzero for ¢ < y < d. Then there exists a unique integral 
curve of equation (2.7) passing through each point (Xp, yo) Of the rectangle 
Cite De ye 


(x) fa(¥). (2.7) 


* For students with a background in point set theory, 
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Proof. First we prove the uniqueness. Suppose (x) satisfies (2.7) 
and the condition (xy) = yy. Then we have the identity 


d(x) 
= filx)falo(x)], 
dx 
which can be written as 
d(x) 
Se 25 
flo) 7 a) 


since fa(v) ~# 0. Integrating both sides of (2.8) between x, and x, we 
obiain 


oa) dolE rz e 
eer 
'Hro=vo f,lo(s)] <0 


Therefore 
Fp[e(x)] — Felyo) = File) — Fi). (2.9) 


where F,(y) is any primitive of I//.(v) and F(x) any primitive of f\(). 
But since f(y) is strictly monotonic [since Fy(y) = 1/f.(5:) = 0], we can 
solve (2.9) uniquely for o(x):? 


o(x) = Fr'[Fa(¥o) + Fi(x) — Fi(%o)]. (2.10) 


Thus, assuming that (2.7) has a solution equal to yy for x = Xp, we find 
that this solution must have the form (2.10) and hence must be unique, 
since all the functions in the right-hand side of (2.10) are completely 
determined by the differential equation and the initial condition. 

To prove the existence of a solution of (2.7) equal to yo for x = X49, 
we need only note that the function o(x) defined by (2.10) actually 
satisfies the differential equation (in a neighborhood of x») and the 
initial condition. In fact, differentiating (2.9) with respect to x, we obtain 

PACT 00) = FO) 
dg(x) 
or 


l 
oe i) = iC), 
jae °°? =A° 


1.e., y == o(x) satisfies the differential equation. Moreover, the initial 
condition ts satisfied, since 


O(Xo) == Fy ‘LF 2(¥o)] = Jo- 


This completes the proof. 


* By Fy’ is meant the inverse of the function F;. 
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Remark. If f(y) vanishes at some point y = ),, then uniqueness may be 
lost, depending on whether or not the integral 


("4 (2.11) 
‘v0 fo) 

converges as y approaches . If the integral converges, then infinitely many 
integral curves pass through certain points of the rectangle Q, and they are 
all tangent to the integral curve y = y, corresponding to f(y) = 0. On the 
other hand, if (2.11) diverges as + y, -: 0, there is always a unique integral 
curve passing through a given point (x9, Yo) of Q.° Of course, here we assume 
that f{(x) is not identically zero, since otherwise there will always be a unique 
integral curve passing through each point of Q. 


Problem I. Investigate the behavior of the integral curves of the following 
equations: 


dy sin xX dy sin y dy ie 
20 area ane 2. Mee Sener 5 c)— = 3/_: 
dx siny dx  sinx 4 i y 


I 


dy y dy sin x dy sin y 
d)— fe ye 8 f)— = 3 —~ | 
XxX Ld 


© dx NN siny sin x 
Problem 2. Consider the equation 


dy £0) 


dx a(x)’ 


where the functions 9(x) and /(y) are defined and continuous for all nonnegative 
x and y, and satisfy the conditions (0) = f(0) = 0. Suppose 9(x)fG)) < 0 in 
the domain G: 0 <x < 0, 0 <y< @. Investigate the behavior of the 
integral curves as x — «% and also as y — «©, depending on whether or not the 
following integrals converge: 


Se oe 
Jo 7G)" ‘1 fQ)" J a" H1 9x)” 


On the other hand, suppose 9(x)/(y) > 0 in G. Show that every integral curve 
passing through a point of G approaches the origin arbitrarily closely if the 
curve is continued in the direction of decreasing x (part of the curve may consist 
of boundary points of G). Find a criterion for the solution to have an asymptote. 
Next suppose that in addition to the above requirements, 9’(0) # 0, /"(0) # 
and ¢"(t), f(r) are continuous in some interval 0 < f < c. Prove that 
a) Every integral curve approaches the origin along a definite direction; 


* Uniqueness can omiy be lost if an integral curve y = 9(x) is tangent to the integral 
curve y = ¢ at an interior point (x, y,) of the reclungle Q. But this is impossible if (2.11) 
diverges as } — yy, since then the left-hand side of (2.9) becomes infinite as x -- x, while 
the right-hand side remains finite. 
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b) If %’(0) 4 (0), all the integral curves are tangent to one of the coordinate 
axes (which one?) for x = 0; 


c) If¢’(0) = f’(0), the integral curves approach the origin along all directions 
(as lines y = kz). 
Finally, make a similar analysis of the solutions of the equations 


dy — $(x) dy 
dx f(y)’ dx ay) 


under the same assumptions. 


Problem 3, Let the functions ¢(x) and f(y) be continuous and positive for 
a<x <b,c¢ <y <d, and suppose each function approaches zero or infinity 
at the end points of the interval on which it is defined. Study all possible con- 
figurations of integral curves of the equation y’ = ¢(x)f(y) in the rectangle 
a<x <b,c <y <d. Then use the result to make a detailed investigation of 
the integral curves of the equation 


y= PEAY) 
7 RESO)’ 


where P, Q, & and S are polynomials. 


6. Homogeneous Equations 


By a homogeneous differential equation we mean an equation of the form 


Suppose the function /(w) is defined in the strip @ <u < 5. Then the function 
F(/x) will be defined in the pair of angles consisting of the points (x, ¥) such 
that 


The domain consisting of these two angles will be denoted by G. 


THEOREM. Suppose f(t) is continuous and never equal to u in the 
interval a <u <b. Then there exists a unique interval curve of equation 
(2.12) passing through every point (Xo. Yo) of G. 

Proof. Setting y =: ux, we write (2.12) in the form 

xu’ -- ue == f(t). 
But then | 
du f(u) —u (2.13) 
dx x | 
which is a differential equation with separated variables, and the proof 
is completed by invoking the theorem of the preceding section. 
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It follows from (2.13) that 
dx dit 
. Saat 
and hence 


ln xi? a + C, (2.14) 


where (uw) is some primitive of the function 


1 
fluy— a 


We see from (2.14) that all the integral curves of a homogeneous differential 
equation are similar (in the geometric sense), with the origin as the center of 
similitude. In fact, there is clearly a choice of the constant C, such that the 
subsutution x — Cyx, y—» Cyy carries the curve 


In |x| == ® (2 
a 


into any given curve of the family (2.14). 

The exceptional case f(z) = u has 
already been considered in Example 
1, p. 6. If f(u) = uw at separate points 
Wi,..., U,, then several integral curves 
may pass through certain points (Xo, ¥9) 
in the domain G, if the integral 


sr A) ca 
converges as uw approaches one of the 
numbers u),...,,, say u,. The be- 
Ficure 5 havior of the integral curves in this case 
is shown schematically in Figure 5. 
For example, the integral curves AB,C,, 4B, B,C., AB ByCy, ... pass through 
the point A and are all tangent to the line y = u,x. 


Problem 1. Investigate the behavior of the integral curves of the following 
equations: 
dy dy dy y 
© dx ae dx ) x x 
Problem 2. Prove that if all the integral curves of a differential equation (2.1) 
are similar, with the origin as the center of similitude, then the equation ts 
homogeneous. 
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Problem 3. Suppose /(«) is a continuous function on 0 <u < my such that 
f(0) =0, fa) #u (0 <a < up). 


Study all possible configurations of integral curves of equation (2.12) in the 
sector 
y ly 
ge oe x => Q) 
0 pe fom, 7 (x Q). 

Then use the result to make a detailed investigation of the integral curves of the 
equation 

dy  P(sin 9, cos 9) 

dx  Q(sine, cos)’ 


where P and Q are polynomials in two arguments, and 9 is the polar angle. 


Problem 4. The most general homogencous equation defined on the whole 
xy-plane (except at the origin) is of the form 


a EP), (2.15) 


where F(9 + 2x) = F(p). Suppose F() can “go to infinity” in a sense like that 
of Sec. 2, but is otherwise defined and continuous. Under what circumstances 
can (2.15) be written in the form 

dy fil) 


—— 


dx fl?) 


where the functions /; and /, are continuous, finite everywhere and periodic 
with period 2x, and do not vanish simultaneously ? 


Problem 3. After transforming to polar coordinates, equation (2.15) of 

Prob. 4 becomes 

do 

— = P(o)s, (2.16) 
do 
where the function (9) has the same properties as the function F(9) of Prob. 4. 
Equation (2.16) is more convenient for studying solutions. In particular, suppose 
P(g) is finite everywhere. Describe the behavior of the integral curves when they 
are continued indefinitely in both directions, as it depends on the sign of the 
integral 


oO re 
~~ 


“ 0(9) de. 
0 


e 


What happens if the integral vanishes? 


Problem 6. Find the most general solution of the equation 


dy of IX POY Ae 

dx f Qx + by + cy ; 
making appropriate assumptions, and describe the behavior of the corresponding 
integral curves. 
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7. Linear Equations 


By a linear differential equation of the first order we mean an equation of 
the form 
dy . | 
— = a(x)y + d(x). (2.17) 
dx 
THEOREM. Suppose a(x) and B(x) are continuous fora<x<b. 
Then there exists a unique integral curve of equation (2.17) passing through 
each point (No. Vp) of the strip @-< xX <b, —0 <y < c and defined for 
all x in the interval (a, 6). 


Proof. First we consider the simplest case, where b(x) == 0. Then 
(2.17) reduces to the “homogeneous” linear equation® 
dy | 
— == a(x)y, (2.18) 
ax 
which is an equation with separated variables. Recalling the results of 
Sec. 5 and noting that 
“Yu dt; 


ee 4 


diverges as y > 0, we conclude that (2.18) has a unique solution passing 
through the point (xp, J). The fact that this solution is given by the 
formula 
aio en - 
y(X) = ¥q eXp is a(é) dé, (2.19) 
“OQ 
can be verified by inspection. 

We now solve the original “nonhomogeneous’’ equation (2.17) by 
the method of variation of constants. This is done by writing 


v(x) = 2(x) exp a a(Z) dé} ; (2.20) 


where z(x) is now a function of x, instead of a constant as in (2.19). 
An elementary calculation shows that (2.20) is a solution of (2.17) if'and 
only if z(x) is differentiable and satisfies the equation 


dz et ey ge 
— = h(x) exp Ee |. a(2) a) 
dx oa 


* This use of the word “homogeneous” is not to be confused with the different use of 
the same word in Sec. 6. 
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Moreover, }(X)) = ¥,) if and only if z(c,y) = yo. Therefore 


oe fs | 
z(X) = Yo + I; b(s) exp i? a(Z) dé, ds, 
and hence 


VCR) SS ZUR EXD, iM a(é) dz) 
== 9 exp Un. a(é) de +2 B(x) exp 1" az :) dé ds 


is the unique solution of (2.17) which reduces to yy for x = Xp. 


Problem J. Show thatifia =< 1 the equation 
ay 

—=alx)y + db(x)y" 

cx : : 


(Bernoulli's equation) can be reduced to a linear equation in z ay the substitution 
z = y*, for a suitable choice of k. (For odd n, assume that } > 0.) How does 
one solve Bernoulli's equation if 7 = 1? 


Problem 2 (QO. A. Oleinik). Let p(x), giv) and r(x) be continuous functions 
ona <x < & such that 


pla) = p(b) = 0, pix) >O0 (a<x <5), g(x) > 0 (a <x < 4), 
— =| —=+%n (O<2<b — a). 
Ja p(x) #b-= p(x) 


Prove that all solutions of the equation 


ae dx vob dx 


(x) giony =r 
P(X) ai +gx)y =r 


which exist on the interval @ <x <6 converge to r(d)/q(b) as x -- 6, Prove 
that one of these solutions converges to r(a)/q(a) as x -- a, while the others 
converge 10 4% Or —%, 


8. Exact Equations. Integrating Factors 
As already noted in Sec. 2, it is often convenient to write a differential 
equation in the form* 
M(x, y) dx -+ N(x, y) dy = 0. (2.21) 
If the left-hand side of (2.21) is an exact differential of some function of x and 


y, we call (2.21) an exaer differential equation. Suppose Af(x, y) and N(x, ¥) 
are Continuous and have continuous partial derivatives 0//@y and dN/éx on 


* Note that here V has a plus sign, unlike equation (1.3). 
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some domain G. Then, as we know from calculus, a necessary and sufficient 
condition for the left-hand side of (2.21) to be an exact differential is that 


ile cree (2.22) 


provided G is simply connected. 


THeorem. Let the functions M(x, vy). NG, ¥) and their partial 

derivatives @M/éy, ON/@x be continuous on a rec tangle Q:a<x<b, 

<y<d, and suppose N(x, y) is nonvanishing in Q and the condition 

(3 22) holds everywhere in Q. Then there exists a unique integral curve of 
equation (2.21) passing through each point (Xo. Vo) of Q0 


Proof. As already noted, there is a function z(x, ) defined on Q 
whose exact differential equals the left-hand side of (2. 21). This does not 
involve the assumption that NV =4 0. However, since V = 0, we can write 
(2.21) in the equivalent form 


M(x, vy) + Ny yy’ = 0, 


or 
i200] _ 
dx 
since 
M = ge . N= & ; 
Ox oy 


~ 


Therefore (x) is a solution of (2.21) if and only if 
z{x, y(x)] = C = const. (2.23) 


This equation can be satisfied by a curve passing through the point 
(Xq.¥o) only if C = z(xp. yy). But if C = zx, Yo), it follows from the 
implicit function theorem that (2.23) defines a unique curve passing 
through the point (x9, V9), as required. At the same time, we have proved 
that the desired solution is given by the equation 

2(%; ¥) oe =(Xo, Vo). (2.24) 
As we know from calculus, finding the function z(x, y) involves the 


evaluation of two integrals. 


* A domain G is said to be simply connected if whenever G contains a closed polygonal 
line L with no self-intersections, it also contains the interior of L (the set of all points 
inside L). 

* With these assumplions concerning Wf and N, it follows from equation (2.21) that 


dxidy is nons ‘anishing in Q. Therefore every integral curve of (2.21) lying in Q must be the 
graph of a function of x 
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Example. Suppose we are given a direction field defined by the equation 


Me 


en ee 
ad — = ) == xdx --+ ydy = 0 (2.25) 
on the domain between two squares with centers at the origin and sides 
parallel to the coordinate axes, of lengths 2 and 4 respectively, We cannot 
apply the above theorem directly to the whole domain G, since N(x, y) = y 
vanishes on the x-axis. However, we can apply the theorem separately to the 
four rectangles 


Oe 2 ee 2 Ty 2, 
Qe —2<axX<2,-2< yy < —l, 
O.. Ll<x<2,-2< y < 2, 


O,; —-2<x< —-l,-—2<y< 


to 


provided that for Q, and Q, the roles of x and y are reversed in the statement 
of the theorem. Then it pollows from all four results taken together that a 
unique integral curve of equation (2.25) passes through each point of the 
domain G. In fact, the curve is just a circle (or a circular “arc) with its center at 
the origin (cf. Example 2, p. 7). 


Even if equation (2.22) fails to hold, it is sometimes a comparatively 
simple matter to transform the differential equation into an exact equation. 
This is accomplished by using an integrating factor w(x, y), ie., a function of x 
and y such that the left- Rand side of (2.21) becomes an exact differential 
after being multiplied by u(x, ¥). Ifthe functions Af(x, 3), N(x, y) and u(x, y) 
have continuous parual derivatives, the integrating factor u(x, y) must satisfy 
the condition 


or more explicitly, 
Ou Ou i. i OM 
tL : 
¥ Ox LOX Gy 


This 1s a linear partial differential equation of the first order. To transform 
the left-hand side of (2.21) into an exact differential, we only need any par- 
ticular solution of (2.26).4° However, as we shall see in the Supplement. the 
problem of finding such a particular solution is in no way simpler than that of 
finding the general solution of (2.21). 

This concludes our discussion of elementary methods for finding solutions 
of first-order differential equations. For other methods, we refer the reader 


"The trivial solution uw —- 0 af (2.26) is obviously of no interest. 
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pecially those by 


to some of the books cited in the Bibliography on | 7 (es 
of reducing differen- 


Ince nnd Kamke). These me sthods usually have the oe 
tial equations to one of the forms considered above. 


Problem 1. ts the expression 


SUN dy 


_—_— 


a¥ 
xX bd “t= ur 


the exact differential of sore function in the domain considered in the example 
on p. 23? How about 
yax —xdy, 


*y 
vas 


x? + y 


Problem 2, Suppose equation (2.21) has continuously differentiable coefii- 
cients satisfying the condition (2.22) and defined on a simply connected domain. 
Show that if (2.21) has a closed integral curve, there is at least one point (4%, J) 
inside the curve such that 


M(Xo; Yo) = NQo: Yo) = 


Problem 3. Suppose the condition (2.22) is satisfied in an m-connected 
domain (1 = 2,3,...), ie. in a domain with “holes.” (Give a precise 
definition of m-connectedness.) Moreover, suppose the functions M(x, y) and 
N(x, y) are continuously differentiable and do not vanish simultaneously. 
Prove that under these conditions the equation of an integral curve passing 
through a point (x, +4) of the domain can be written in the form (2,24), where 
however, the function z(x, y) is not single-valued but is defined only to within a 
term of the form 

Hi Cy ae? i Gas 


Here Cy...., Cy, are certain completely determined constants (called 
Aaa While #,,...,%,-, are arbitrary integers. Under what conditions 
will the function z(x, y) be single-valued ? What happens if a1 = 2? Whatis the 
aeeace between the case of commensurate periods and the case of incom- 


mensurate periods if m > 27 What happens if the domain is ‘infinitely con- 
nected?" 


Problem 4. Suppose the left-hand side of equation (2.21) becomes the exact 
Gifierential of a function z(x, y) after being multiplied by a suitable integrating 
factor u(x, ¥). Show that the function uf[z(x, y)], where /(z) is an arbitrary 
continuous function of :, is also an integrating factor of (2.21). Show that ifall 
relevant functions are continuous and if 


[AT(X, Yoh + IN CO%q, Ho) > 0, U(X, Vo) ¥ 0, 


then any integrating factor of equation (2.21) insome neighbor hood of the point 


(Xp, J) can be represented in this form. (However, this 1ast assertion is in general 
false for the domain as a whole.) 
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Problem 5. Let z,(x, y) and z,(x, ¥) be continuously differentiable functions 
on a domain G such that 


dz; oz, 
az a Oz, ax Oy 
——| -+ }——| > 0, . ' ne oe 
Ox | Oy OZ, 924 

Ox ay 


Prove that zy is a function of z,; In some neighborhood of any point of G (but 
not necessarily in the whole domain G!). 


Problem 6, Find an integrating factor for the linear equation written in the 
form 
dy — fa(x)y + b(x)] dx = 0. 


Problem 7. Let M(x, y) and N(x, y) be twice continuously differentiable 
functions ina rectangle Q, where N # 0. Prove that a necessary and sufficient 
condition for the existence in Q of a continuous integrating factor » + 0 [of 
equation (2.21)] depending only on » is that 


v( BN  BAL\ aN fe aM 
ed eae 


h 


in QO, 
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GENERAL THEORY 


There are relatively few differential equations whose integrals can be 
found by elementary methods. In fact, as shown by Liouville in 1841, even the 
simple Riccat! equation 

dy 0 
— = a(x)" + ay(x)y + a(x) 

dx 
cannot be solved by guadratures, i.e., by subjecting known functions to a 
finite number of algebraic operations and integrations (as was done in Secs. 
3-8).! Therefore a problem of great importance is to develop methods for 
approximate solution of differential equations which are applicable to large 
classes of equations. But before trying to find approximate solutions of an 
equation, we must make sure that the equation has solutions in the first place! 
The first part of this chapter is devoted to existence questions of this sort. 
The way in which existence theorems are proved will often suggest methods for 
approximate determination of solutions (see e.g., Secs. 9, 13, 14 and 17). 


9. Euler Lines 


Suppose we are given a first-order differential equation 
y == f(x, y), (3.1) 
where f(x, y) is defined on a domain G. As we know, equation (3.1) defines a 
direction held in G, which in turn has certain integral curves. Choosing any 
point (Xp, yo) in G, we draw a line segment through (xy, yg) with slope f (Xo Jo): 


; cr 1 bs b * : ale 
Concerning the problem of solution by quadratures, see e.g., I. Kaplansky, da 
{Introduction to Differential Algebra, Hermann et Cic., Paris (1957). 
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Then on this segment we choose another point (x, y,) in G, indicated by the 
number | in Figure 6. Through (x,, y,) we draw a line segment with slope 
[(%p 1), and then on this new segment 
we choose a new point (x., v2) in G, 
indicated by the number 2 in the figure. 
Suppose we repeat this construction in- 
definitely, and let the points be such that 


‘ en ” — 
Vg <— AE om Xa Ti fee oe 


(the construction can also be carried out 
in the direction of decreasing x). Then 
the resulting polygonal line will be called 
an Euler fine. 

[t is natural to expect every Euler line passing through (x, ¥9) to resemble 
an integral curve passing through (xp, ¥,), if such an integral curve exists and 
if the segments of the Euler line are sufficiently short. Moreover, we might 
expect the Euler line to approach an integral curve as the length of its segments 
goes to zero, and it will be shown later that convergence of this sort actually 
occurs if f(x, y) is continuous. However, in general there will be no uniqueness, 
1.e., In general there will be several integral curves passing through the same 
point (Ng, Vo). In fact. Lavrentev has constructed a differential equation of the 
form (3.1) such that f(x, ¥) is continuous and at least two integral curves 
pass through any given point (%, V9) of G, in every neighborhood of (Xp, }9).” 
Thus we must assume more than continuity of f(x, ¥) if only one integral 
curve is to pass through (Xp, }'9). 

In Sec. 11 we shall show that continuity of f(x, y) implies the existence of 
an integral curve of equation (3.1) passing through any point (9, Vo) in G. 
The proof is due to Peano, and is based on the use of Arzela’s theorem..It is 
clear that every such integral curve is the graph of some solution of the differ- 
ential equation (3.1)..¢. © 


FIGURE 6 


Problem. Let f(x, y) be continuous and bounded on the strip 


agxca’, —-ws <y< wo (a <a’). 


Show that the set of all Euler lines of equation (3.1) drawn from a point (a, 0) 
completely fills the region R bounded from above by the curve y = 9,(4), 
a< x <a’, from below by the curve y = 9.(x), @ < x < a’ and from the right 
by the line.x = a’, where part orallof the curves y = 9,(%), » = 9e(%) may belong 
to &. Here it is assumed that 
a) y = 9,Cx) is concave downward and y = ga(x) Is concave upward; 
b) ela) = e2(a) = 6, 9j(a) = 9,(a) = f(a, 8); 
c) The left and right-hand derivatives of ¢,(x) are not exceeded by f[x, 21:0)], 
while the left and right-hand derivatives of g(x) do not exceed f[x, Qa(x)]. 
7M. A. Lavrentev, Sur une equation differentielle du premier ordre, Math, Z., 23, 197 
(1925). 
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10. Arzela’s Theorem 


Tueorem. Let {f(x)} be an infinite family of uniformly bounded, 
equicontinuous functions defined on a finite tnterval (a, 6) Then f f(x} 
contains a uniformly convergent infinite subsequence. 

Proof The uniform boundedness means that there exists a constant 
Af such that ‘ bai 
Mf such if@| <M 
for every f(x) in the family and every x in (a, 6). The equicontinuity 
means that given any ¢ > 0, there exists a number q = 4(¢) > Osuch that 
Ix" — x"] < 4 implies 

FO) — FO) <« 
for every f(x) in the family and every x’, x” in (a, 6). 

Turning to the proof itself, we note that because of the uniform 
boundedness, the graph of every function in the family {/(x)} lies in the 
rectangle 4BCD shown in Figure 7, with sides 24/4 and 5 - a, Consider 
the infinite secuence 


_oM M M 
“i atl =2.0o ya+2 ) > Sp yak ’ . 


Where « Is any nonnegative integer, and let 4, = 7(e,) be the number 
associated with ¢, in the definition of equicontinuity. Then partition the 
rectangle ABCD into subrectangles. 


a Ol a. C by drawing a family of horizontal 
ae ae a } lines including the x-axis a distance 

: ontHey 7 | of | , =, apart, and a family of vertical 
ol «i mie ‘ a7) Tt lines including the lines x = a and 
| n/a a x=ba distance aT apart. Let 
; fot? rd the corresponding vertical strips be 

E Weds ee 9 labelled consecutively by Roman 
1 | | | numerals I, If,..., as shown in 
‘ D Figure 7. Since jx’ — x"| <4 


FIGURE 7 


implies | f(x") — f(x")| < , no 
function in the family {f(x} can 
have a graph lying in more than two adjacent rectangles belonging toa 
given strip, in particular to strip T. Therefore there is at least one pair of 
adjacent rectangles in strip I, e.g., the shaded pair in Figure 7 (where 
% = |), containing the graphs of infinitely many functions in { f(x)}. After 
leaving Strip I, tne graphs of all the functions in this infinite subfamily 
of if (x)} Can only pass through four rectangles in strip H, while the 


< a * a ’ ; 
. It does not matter whether the interval is o 
interval contains its end points, 
4 by i. "2 + ; 
Suggested by L, A, Lyusternik, 


pen or closed, ie., whether or not the 
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graph of any given function in the subfamily can only pass through 
two neighboring rectangles in strip I]. As before, there must be two 
neighboring rectangles in strip II, also shaded in Figure 7, containing the 
graphs of infinitely many functions in {f(x)}, each of which falls entirely 
in the shaded rectangles in strip I as well. Continuing this construction, 
we finally construct an entire band of width 22), covering the whole 
interval (a, 6) and containing the graphs of infinitely many functions in 
the family (f(x)}. This band 6, is shaded in Figure 7. Let f#(x) be any 
function in 4,, and let {/,(x)} denote the infinite family consisting of 
the remaining functions with graphs in 6,. 

Next we treat the family {/{(«)} in the same way as the family 
{f(x)}, except that we now choose <, instead of <, and %» instead of 7). 
This leads to a band 6, of width 22, containing the graphs of infinitely 
many functions in {f,(x)}. Let /2(x) denote any of these functions, and 
let { fo(x)} denote the family consisting of the remaining functions with 
graphs in b,. Repeating this construction indefinitely, we obtain an 
infinite sequence of functions 


on : a ae 

PONS nas CO iexiorss (3.2) 

where the graphs of all the functions starting from f%(x) lie in a band 6, 

of width 4f/2***-1, Therefore the sequence (3.2) is uniformly convergent, 
and the theorem is proved. 


Problem I. Give examples showing that the uniform boundedness and the 
equicontinuity are both essential conditions in ihe statement of Arzela’s 
theorem, 1e., show that the conclusion of Arzela’s theorem may fail if either 
condition is dropped. 


Problent 2. State and prove Arzela’s theorem for functions of several 
independent varlables. 

Problem 3. Does Arzela’s theorem remain true if we allow the interval (a, 5) 
to be infinite? 


Problem 4. Show that Arzela’s theorem remains true tf instead of uniform 
boundedness on the whole interval (a, 6), we require only that [f(%)| < Af 
for every f(x) in the family and some xp in (a, 4). 


if. Peano’s Existence Theorem 
THEOREM (Peano). Let f(x. 1) be bounded and continuous on a domain 
G. Then at least one integral curve of the differential equation 
dy vee 
== 231 (X59) (3.3) 


dx 


passes through each point (Xo, Vo) of G. 
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Proof. Assuming that [f(, yl < M, draw the lines with slopes Af 
and —Af through the point (Xp, 4) in G. Then draw the ae X = aand 
x= h(a <x) < 4) parallel to the y-axis, which together with the first 
nwo lines form two isosceles triangles contained in G with common 
vertex (Xq, Vo). aS Shown in Figure 
8. Next, using the method of Sec, 
9, construct an infinite sequence 
of Euler lines 


ae dens ee ey ae + 


passing through the point (x, y,) 

such that the length of the largest 

segment of ZL, approaches zero as 

Piedaes k —» oo. Every polygonal line L, 

(k = 1,2,...) intersects any line 

parallel to the y-axis only once, and hence is the graph of some contin- 
wous function of x, Which we denote by ¢,(x). The functions 


2,(x), G(X), et ee eC), 


have the following properties: 


1, They are all defined on the finite closed interval fa, 5]. In fact, 
o,(x) can only fail to be defined on [a, &] if the corresponding 
Euler line L, leaves the domain G at some interior point of [a, 4]. 
But this is impossible, since the slopes of the segments of L,. are 
less than Af in absolute value, and hence L, cannot leave the ur- 
angles ABC and ADE through the sides BE and DC. 


. They are uniformly bounded on [a, 6]. since all their graphs are 
contained in the triangles ABC and 4 DE. 


to 


od 


. They are equicontinuous on [a, b], since obviously 


lo,(x") — 9,(¥')| < M |x” — x"| 


for every K = 1, 2,... and every x’, x” in [a, 6). 


re 


Thus the sequence (eal )j meets all the requirements of Arzela’s 
theorem, and hence contains a subsequence 


ga). oI), 2.5 9G), 
Which converges uniformly on the whole closed interval [a, 6] to a limit 
function 


o(x) = lim +(x), 


| ends # 


Where ¢(x) obviously satisfies the initial condition o(Xy) = Yo 
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Next we show that the function (x) satisfies the differential equation 
(3.3) in the interval (xq, 6) (a similar argument is applicable to the interval 
(a, X¥4)]. Choosing an arbitrary point x’ in (x9, 5), we shall prove that 


A= A) _ Fy eer (3.4) 


for arbitrary = > 0, provided that [x” — x’] is sufficiently small. To 
establish (3.4), it is sufficient to prove that 


ra rae ot 
pana bek (x’) 3.4") 


" 
—, 
. 


— fix’, o(x')] | i 


for sufficiently small |x” — x’| and sufficiently large 4, since 


ox) ox"), gx") > of") 


as k — oo, Since f(x, y) is continuous on the domain G, given any = > 0, 
there is an 7, > 0 such that 


few) Se fay <I se 


where y' = (x’),° provided that 


“% 


lx — x] < 2% [1 


For sufficiently small , the set of 
points (x, y) in G satisfying these 
inequalities is a rectangle Q (see 
Figure 9). Now choose X so large 
that k > K implies that all the 
segments of £, are shorter than y 
and that 


lox) — gO) < My 


poe ee 


FIGURE 9 


for all x in [a, 6). Then every Euler 
line y = 9)(x) lies entirely in Q if k > K and |x — x'| < 2% (why?). 
Moreover, if |x" — x’| <4, we have® 


LPO vy) — el” — x’) < ol (x7) — of (x) < [ fi) + s]Q@" -— ¥), 


which implies (3.4). This completes the proof. 


* Here it is important not to confuse y’ with the derivative dyfdx. 

° We chose Q to be the rectangle such that |x — x’| <2y instead of simply [x — 3°] < % 
in order to be able to estimate ihe a of the extreme left-hand segment of an Euler Hine 
lying in the strip S: x’ ~ 4 < v. SX + a The ae point of this segment may not He in 
S, but it will certainly lie in the Strip.x 24 SX x, 
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Remark 1. The same argument appl lied to the interval (xq, b) with x’ = x, 
shows that the right- hand deriv: ative of o(x) at X = NXg, Le., 


P(X") = (Xo) 
is. eres lg 


7 ‘ 
xo 3* Sat ( NX ey Ny 


equals a [xp. o(Xp)]. Similarly, app’ lying the argument to the interval (x, 5) 
with x’ = 8, we see that the left-hand derivative of o(x) at x = 4, ie., 
a(x") — 9(b) 


[ets 
wera xX" —b 


equals fb. o(b)). Similar assertions hold for the end points of the interval 
(a, Xo). 


Remark 2. The above construction leads to a function 9(x) equal to Vo 
for X = Xp Which satisfies equation (3.3). but only on the closed interval [a, 5]. 
Consider one of the end points of the resulting are y = (4), say U the right- 
hand end point P. By construction P les in G, and hence we can repeat “the 
whole argument, choosing P instead of (Xo, Vy) and constructing Euler lines 
to the right of P. In this way, we are able to continue the integral curve up toa 
point P;, say. Again we can draw Euler lines through P,, extending the integral 
curve further to the right, and so on. This construction leads to integral 
curves which come arbitrarily close to the boundary of G. In fact, the resulting 
“chain’’ of isosceles tangles of the type ABC and ADE must “ultimately 
shrink to zero,”” since otherwise the chain would contain infinitely many 
triangles with lateral side length greater than some number <¢ > 0. But then 
we could construct an integral curve covering an arbitrarily large interval of 
the x-axis, contrary to the assumption that G is bounded. This, together with 
the fact that sides of triangles like ABC and ADE can come arbitrarily close 
to the boundary of G, proves our assertion, except for details (see Problem 2). 


Remark 3, Peano’s theorem remains valid even if f(x, y’) is only continu- 
ous on G, since every such function is bounded in any domain G’ whose 
closure is contained in G. Using this fact, we can extend Remark 2 to the case 
of pou one ‘e Thus, given a solution of the ee equation (3. 3) 
on theintervala <x < b, there exists animtervala’ <x <b" (a <a,b <b) 
and a solution y = ee of (3.3) on this interval coinciding with the given 
solution on @< x < 6 such that one of the following three possibilities 
occurs: 

a) Do = +10; 

b) 19(2)| ~+ +00 as x -- b! — 0; 


c) The distance between the point (x, o(x)) and the nearest point of the 
boundary of G approaches zero as x —> b' — 0. 


Phe situation is the same at the left-hand end point. 
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Problem I, Give a detailed proof of Remark 2. 

Hint, In constructing the chain of isosceles triangles, let the lateral side 
length of the “next triangle” equal one half the distance between the “preceding 
triangle” and the boundary of the domain G. Prove that the resulting sequence 
of vertices converges to a boundary point of G. 


Problem 2, Give a detailed proof of Remark 3. 


Problem 3. Suppose the domain G its a strip @ <x <a’, and let f(x, y) be 
continuous and bounded on G. It is possible that more than one integral curve 
of equation (3.3) passes through a given point (xg, 9) of G. Prove that there 
are two integral curves y = 9,(x) and y = 9(x), the maximum and minimum 
solutions (in the sense of Montel) such that 

a) O(%q) = 2%) = Vor Pol) < (x), @ <x <a’; 

b) The entire part of the strip between y = 9,(x) and y = gofx) can be 

completely filled by integral curves passing through (xp, yo); 

c) There are no integral curves passing through (xg, yo) which lie outside 

this part of the strip. 

Problem 4. Prove the existence of solutions of equation (3.3) by the following 
method, due to Perron: By an upper function on the interval (x9, b) [see Figure 8] 


is meant any continuously differentiable function y = ¢(x) whose graph lies 
entirely in G and which is such that 


9(%) = So 2'(x) > SL, (39), Ng ee ee 
Prove that 
a) Upper functions exist; 
b) The greatest lower bound of all upper functions is an integral curve of 


equation (3.2) passing through the point (vp, yy) [in fact, the maximum 
solution of Prob. 3]. 


Similarly, one could define fower functions and take their least upper bound. 
The same thing can be done for (a, xp). 


Problem 5. Let f(x, y) and F(x, y) be two bounded functions defined on a 
domain G such that F(x, ») > f(x,y) everywhere on G. Suppose F(x, y) ts 
upper semicontinuous, while f(x, y) is lower semicontinuous, 1.€., suppose that 


F(x, y) = limsup F(, 5), 
| a 
ee sx. y) = lim inf f(s, 5). 


Prove that there is at least one curve y = 9(x) passing through each point 
(Xo, Ko) Of G such that as Ax —- 0 all the limiting values of the ratio 
a(x + Ax) — ¢(x) 
Ax 
lie between f(xy, y) and F(x, ¥), where x is an arbitrary point of the curve. In 
general, there are many such curves passing through each point (xp, Vy), Including 
a maximum and a minimum curve in the sense of Prob. 3. 
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[2. Osgood’s Uniqueness Theorem 


THEOREM (Osgood). Suppose the function f (x, y) satisfies the condition 


IF (%, Yo) — fO YI < Pl ¥2 — Vid (3.5) 


for every pair of points (x, y;), (X, Ye) in a domain G, where o(u) > 0 is a 
continuous function on 0 <u < aand 


lim | -——= =o 

et0+erE o(u) 
Then there is no more than one solution of the equation y’ = f(x, y) 
passing through each point (Xo, Yo) of G. 


Proof. First we note that the functions 
Ku, Ku |In ul, Ku |In ul 1n [in ul, Ku [In u| In [Inu] -InIn|Inul,..., 


where K is a positive constant, all have the properties required of ¢(w). 
The most common choice is (uw) = Ku. In this case, (3.5) becomes 


| f, ye) — f(x, Vy) | —< K |ye — Val, 


and is known as a Lipschitz condition.’ 
Turning to the proof itself, suppose there are two distinct solutions 
yi(x) and y.(x) such that 


Vi(Xo) = ValXo) = Yo- (3.6) 


It can be assumed that x) = 0, since this can always be achieved by 
replacing x by x + X,. Now let 


Yalx) — yi(x) = 2(%). 
Since y(x) 4 y,(x), there is a point x, such that z(x,) 40. We can 
assume that z(x,) > 0, since the opposite case can be handled merely 
by setting z(x) equal to y,(x) — y.(x) instead of y.(x) — yi(x). Similarly, 
there is no loss of generality in assuming that x, > 0, since otherwise 
we need only replace x by —x. If |v, — y,| > 0, the derivative of z 
satisfies the inequality 


dz d — 
a = arate Heyy 7s Son) = = A. 


(3.7) 


7 Let G be convex in y, which means that every line segment AB parallel to the y-axis 
with end points in G lies entirely in G. Moreover, suppose the partial derivative ¢f/dy is 
bounded everywhere in G. Then f satisfies a Lipschitz condition. In fact, by the law of the 
mean, 


| fx, Ye) — f(x, yi) cs | fob, Via Oye ia yo] [Ve oe yl << K\ys a yal, 


where K is the least upper bound of | f,| in G. 


SEC. 12 GENERAL THEORY 35 


Next we construct the solution y(x) of the equation 


= 20(y), 
dx 


satisfying the condition y(x,) = 2(x,) = z,. Such a solution exists and is 
unique (sec Sec. 4), and its graph approaches the negative x-axis 
asymptotically without ever intersect- 
ing the x-axis (sce Figure 10). By 
construction, the two curves z(x) and 
y(x) intersect at the point (4, z,), and 
the inequality 
2'(x1) < 29(z,) = 29[}(%)] = ym) 
immediately implies the existence of 
an interval (x, — ¢,.x,),¢ => Oonwhich 
2(x) > y(x). 
But this inequality holds for every ¢, if 0 < << x, since otherwise we 
immediately arrive at a contradiction by choosing = to be the largest 
value compatible with the inequality. In fact, we would then have 
2'(X2) > Ye) = 29[Ge)] = 2¢[2(%2)], 
since z2(x) > (x) to the right of the point x, = x, — 2, while on the other 
hand, since z(x.) > 0, (3.7) implies the contradictory inequality 


z'(xs) < 2o[z(x.)]. 


ys 


Ficure 10 


It follows that 

(x) > yx) > 0 
for all x in the interval (0, x,]. In particular. (0) = 2(%)) > 0, contrary 
to (3.6). Thus there cannot be two distinct solutions of the equation 
y = f(x, vy). and the proof is complete. 

Problem J. Prove that if ¢(0) — 0, 9’(0) = 0 and the domain G is convex, 
then a function f(x, y) satisfying the condition (3.5) must be independent of y. 
In particular, this is the case if o() = u?, p > I. 

Problem 2, Prove that if the graph of ¢(u) is concave downward, then diver- 
gence of the integral 

du 
0 (4) 


is not only sufficient but also necessary for validity of Osgood’s theorem. 


Problem 3. Prove that if o(0) = 0 and ¢’(Q) exists, then 


ere 


Jo o(u) - 


“y 


re dt 
| € 


Le., the function o(s) satisfies the conditions af Osgood’s theorem. 
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Problem 4. Suppose the function a(v) figuring in Osgood's theorem is 
chosen in turn to be Ku, Ku|inal, Au [Inulin |iInal,..., and so on, The 
corresponding progressively weaker restrictions on the function I(x, y) lead to 
progressively stronger theorems. Prove that there is no “strongest” theorem of 
this type. In other words, show that if o(t) satisfies the conditions of Osgood's 
theorem, then there always exists another function ¢,(1) satisfying the same 


conditions such that 


61 deoapame 
une qu) 

Problem 3. As we know from analysis, a function f(x, y) defined ona domain 
G can be. continuous in each variable x and y separately, without being con- 
tinuous in both variables jointly. Prove that if fis continuous in x and satisfies 
the condition (3.5), where 

lim o(t) == 0, 

tu~e } 
then fis continuous on G in both x and }. Is this true for functions defined ona 
square, a disk or a triangle, including its boundary? 

Problem 6, Prove that if @f/éy is continuous on a domain G, then there is a 
unique solution of »” = f(x, y) passing through cach point of G. Is mere existence 
of this derivative enough? 

Problem 7. Let f(x, ¥) be continuous on a domain G, and let y(x), zx) and 
a(x) be continuously differentiable on an interval xy <x» <b, Moreover, let 


¥(X%q) = 20%) = UlXg) = Fo, 
where (Xq, 39) iS a point in G, and let 
YO) =v, "Q)>SO.). we) > fo, 0) 


everywhere in xy < x < b. Prove that z(x) > ‘y(x) for all x > xg. Assuming in 
addition that y(x) is the unique solution of y’ = f(x, y) On Xp xX <b, prove 
that w(x) > v(x) for all x > xp. Is this still true if the uniqueness requirement on 
3G) is dropped? Extend the above assertions to the case where it is only required 
that v(x) and z(x) be continuous with right-hand (or left-hand) derivatives. 


Problem 8. Suppose f(x, y) is continuous on the strip a <x <b, -2” < 
<= % and satisfies the estimate [f(x, y)] <= o(|y]), where o(z) > 0 is continuous 
on0Q <2: < «% and 


~ 


ue pa f mw} =. 
ae o(z) 


: pear s MP 7 ‘Pe Lat « 7 a = : is ve - a af “ 
for every N > 0. Prove that a solution sauisfying arbitrary iniual conditions 
can be continued onto the whole interval a < x < b. [This is also true if (x) > 
0.] Give an example of a continuous function o(z) > 0 such that 

ro. dz 


Jo oz) 
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but 
ie dz 
mmeeerrerer: ah. ID 


0 gz) +e 


for arbitrary « > 0. What does this imply about continuation of solutions? Is 
such an example possible if o(z) is monotonic? 


Problem 9. Let y = Y(x),a < x < 6 be the maximum solution (in the sense 
of Prob. 3, Sec. 11) of the differential equation y’ = f(x, y) with continuous 
{(x, y) satisfying the initial conditions x = x9, ¥(x9) = jy» [the curve y = Y(x) 
lies entirely in G]. Moreover, let Y,,(¢) be the maximum solution of y’ = f(x, y) 
satisfying the initial conditions x = x,, Y,(x,) = yn, where y, > ¥(x,) and 

HM Xe = XG lim yn = Yo. 
Ti+ MNwe GG 
Show that 
a) Y,,(x) exists on the whole interval [a@, 6] for sufficiently large n; 


b) Asa -— , Y,(x) converges uniformly on [a, b] to Y(x). 
Problem 10. Let y,(x) be the solution of the equation 


dy 
—— = f(x, y) + ea(x, y) 


satisfying the initial condition y(x9) = Yo, where 9,(x, y) > 0 and the least 
upper bound of 9,(x, 7) on G converges to zero as 1 —- «©. Here the functions 
{@, y) and 9,(x, y) are continuous, and the point (x9, yp) belongs to G. Let 
y = ¥(x) be the same as tn the preceding problem. Prove that y,(+) converges 
uniformly to ¥’(x) on the interval [x9, b] asa — x. 


Problem 11, Prove a more general uniqueness theorem, replacing (3.5) by 
the inequality 
Lf ¥2) — fx, yD < vey, — yD, 


which holds everywhere except at a finite number of values of x (however, this 
stipulation can be significantly generalized). Here 9 is the same as in Osgood's 
theorem, while 4 is continuous except possibly at the indicated values of x and 


“b 
Ux)dxy << x 


- 


on any finite interval (a, 6), Is the last condition essential? 


13. More on Euler Lines 


THEOREM. Suppose there exists a unique solution (x) of the equation 
v= f(x,y), with continuous f(x,y), and let Ly...., £y,... be any 
sequence of Euler lines drawn from (Xq. Vo) such that |L,|, the length of the 
largest segment of L,, approaches zero as k —» «. Then as k —> © the 


sequence of fictions o,(x),.... 4%), 2... with graphs Ly... Lay... 


38 = GENERAL THEORY eain.4 


converges uniformly to (x) on the closed interval [a, b} figuring in the 
proof of Peano’s existence theorem, 


Proof. It is obviously sufficient to prove that given any ¢ > 0, the 
sequence L,,..., Ly... . contains only finitely many Euler lines which 
ae not lie entirely between the curves y = @(x) — ¢ and y = (x) + 
a << x <b. Suppose this is not the case. Then there is an infinite 
sequence of Euler lines Z,,....£;,,... passing through the point 
(Xg, Vo). Where [L,] > 0 as k — o&, none of which lies entirely between 
the curves y = o(x) + c. But then the same construction as used in 
the proof of Peano’s theorem leads to a sequence of Euler lines uniformly 
convergent to an integral curve which passes through (X9, }o) and is 
distinct from the curve y = 9(x). The fact that this is impossible, 
because of the assumed uniqueness of o(x), establishes the theorem by 
contradiction. 


Problem I, Give an example of an equation y’ = f(x, y) with continuous 
f(x, ¥) and a corresponding sequence of Euler lines L),...,£,,... passing 
through the point (x, V9) [where [£,| -- 0 as A -- 2] such that the sequence of 
functions 9,(x),...,.9,(4),... with graphs £,,...,£;,... converges at no 
value of x other than x. 


Problem 2. Give an example of an cquation y’ = f(x, y) with continuous 
f(x,y) such that the limit of a sequence of Euler lines Ly,...,0;,,... beginning 
at any point (where |L,] --Oas k -- 0) always exists and is unique, despite the 
fact that more than one Integral curve passes through certain points of the 
domain, This possibility stems from the fact that a solution of y’ = f(x, y) 
passing through (v9, Yo) need not be the limit of a sequence of Euler lines drawn 
from (Xy, Yo). 


l4. The Method of Successive Approximations 


THEOREM. Suppose f(x, y) is continuous in x on a domain G and 
satisfies a Lipschitz condition 


L/ (x, ¥e) — f(x, y)| < XK [ye a Pail 


in y on every Closed bounded domain G' contained in G (the constant K 
may depend on the choice of G').® Then, given any point (Xg. Vy) in G, there 


* It tollows from these assumptions that fis continuous in x and y jointly on G. In fact, 
PO Ya) — fn vl < Ufa ved — fOr vi) + far) — fr va 
< KU ys yy] + [fra vi) — fOr, vd, 


Where both terms on the right can be made arbitrarily small for (x,, 2) sufficiently close 
to (x, ¥:) [the second term because of the continuity of fin x). 
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is a closed interval {a, 5] containing xy as an interior point on which the 
differential equation y= f(x, y) has a unique solution y(x) equal to y, for 
= Ny: 


Proof. First we note that if such a solution y(x) exists, then integra- 
lion of the identity 


y'(4) = fl. ¥(8)] 


between x, and x gives the integral equation® 


yx) = vo + |) SE y@) ae. (3.8) 


Thus every solution of the differential equation y’ = f(x, y) passing 
through the point (Xp, ¥o) satisfies (3.8). Conversely, every continuous 
solution y(x) of the integral equation (3.8) satisfies the differential 
equation and the initial condition y(%y) = yo-!° In fact, the initial con- 
dition is obviously satisfied by the left-hand side of (3.8), and moreover 
differentiation of both sides of (3.8) leads at once to the differential 
equation y == f(x, ), provided the differentiation is legitimate. But the 
right-hand side of (3.8) is clearly differentiable for continuous (4) 
[being an indefinite integral of a continuous function], and hence so is 
the left-hand side. Therefore, instead of proving that the differential 
equation y’ = f(x, y) has a unique solution equal to yy for x = x, on 
some closed interval [a, 0] (@ < x) < 6), we shall now prove that the 
integral equation (3.8) has a unique continuous solution on [a, 5]. 

Now let G’ be any closed bounded domain contained in G with 
(Xo, ¥o) aS an interior point, and let M be the least upper bound of 
f(x, y)| on G’. As in Figure 8, p. 30, we draw two lines DC and EB 
with slopes Af and — AM through the point (Xo, ¥o) and then two lines 
x = aand x = 5 parallel to the y-axis which together with DC and EB 
form two isosceles triangles ABC and ADE comp letely contained in G’, 
with common vertex (X9, Vo) [later in the proof, we shall assume that a 
and 6 are sufficiently close to xX]. Let 99(x) be a continuous function on 
[a, b] whose graph lies in G’, but which is otherwise completely arbitrary. 
Substituting @)(x) into the right-hand side of (3.8), we obtain a new 
function 


= 


; cary ~ - 
o(x) = vo + ie [Z, o9(S)] de, ae eb; 


It ts clear that ©,(x) is itself defined and continuous on [a, 6], and in 
addition satisfies the initial condition (x9) = Ve. Moreover, the 


*So called because the unknown function appears in the integrand. Note that the 
integrand is a continuous function of & [use footnote § and the fact that 1(4) is continuous, 
being ditferentiable]. 

*© We confine ourselves to continuous solutions of (3.8) only to avoid certain difficulties 
associated with integration of discontinuous functions. 
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graph of 91(%) cannot leave the triangles ABC and ADE, and hence 
cannot leave G’, since 


| TE, @o(E) << AL 
implies | 
lox) — yol < MM |x — Ol. 
Next we set 
Be pee cane 
@a(X) = Yor | ae (E, pils)] 46, 


where the properties of 9,(x) guarantee the existence of the integral on 
the right. The function ¢(%) is also defined on [a, 6], and has the same 
properties as 9,(X). Similarly, we construct further functions 


eae) = Yo +]. FE eo(E)) a, 


er (3.9) 


We now show that the sequence 
9o(%), a(x), eats 0,(X); ee (3.10) 


constructed in this way converges uniformly on [a, 6] to a continuous 
solution of the integral (3.8). For this reason, the functions (3.10) will 
be called successive approximations to the solution of (3.8).1! Since 
©, (x) can be written in the form 

@alX) = ox) + [ei(x) — GolX)] +++ + [aG) — 920); 


it is enough to prove the uniform convergence of the infinite series 


Q(x) or [9:(x) _ G(%)] aie ge oP? ie - Oy -a(X)] Bree (3.11) 
With this in mind, we use the Lipschitz condition to estimate @y.4(x) — 
On x): 


Ppl) a 0,(X)| cs 


FF ben) — en a@ a 
< K(b — a) max |9,(x) — @ a(x) (3-12) 


ace i 


Therefore 
lPol + [91 — Gol + ie — Pal + lea — gel + - 
<M+M+Mm+Mm+-:--, 


'* The method of successive approximations is due to Picard, and can be used to solve a 
great variety of mathematical problems. 
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where K(b — a) = m, and M,, Af, are such that 


|eo(x)] < Ma, |o,(x)| < M, 
and hence 


This numerical series converges for m< 1. Therefore if we choose 
[a,b] so small that A(6 — a) = m <1, the series (3.11) is uniformly 
convergent on [a, 6] to a continuous function 9(x) whose graph lies 
inside the triangles ABC and ADE. Therefore the integral 


iz 5 et ad 
|” SE, (2) dé 


~ To 


is meaningful. Moreover, since 


ie: ¥ bel bal “ “ e- os | 
1 UTS 9) — STS Pals} de 1 lo) — o,-1(8)| de | 


we can pass to the limit #7 —» oo in the relation 


Onl) = Yo +] FE, na] a2 


(cf. (3.9)], thereby proving that ox) satisfies the integral equation (3.8), 
as required, 

We must still prove that (3.8) has no more than one continuous 
solution on [a, b].!* Suppose to the contrary that (3.8) has two distinct 
continuous solutions ¢(x) and (x) on [a, b]. Then 


3 


Ax) = yot] (EO SO) = yo +] SE ede, 


and hence 


a . 
he 


le, ol 2) oar a 9 ols ait ie 
K(b — a) es cls a(x) — o(x)I. 


avers 


ing, 


It follows that 


Bue: |o(x) — o(x)| < K(b — a) max |o(x) — (x). 


wtb aearch 


But ioe can only happen if 


max |o(x) — o(x)| = 0, 


wears hs 
Le., if o(y) os 6(x). Therefore the integral equation (3.8) has a unique 
continuous solution on [a, b], and the proof is complete. 


Remark /, The sequence (3.10) converges to the same limit function ¢(a) 
on [a, 6], no matter how we choose the first function Qy(x) of the sequence, 


2 Note that such a solution is automatically bounded on [a, 4]. 
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provided only that g(x) is continuous and has a graph lying entirely in G 
In fact, as just shown, every such sequence converges to a continuous solution 
of (3.8), and this solution Is unique. 

Remark 2. All the remarks made at the end of Sec. 11 remain in force here. 


Remark 3. We can often get rid of the condition K(b — a) <1. For 
example, suppose the domain G contains a stripe q@x< d, —a<y<o 
(where c and d are finite), and suppose f(x, y) satisfies a Lipschitz condition 

Lf (x, Ve) =f oF yy) < K ae pay il, 

5 re ee ees : Me 

with the same constant K for all (x,y), (x, ¥s) in the strip." Then, if @o(x) is 
, : : ) aeaaoenee a 
continuous on [c,d], the successive approximations (3.10), theniselves all 
defined and continuous on [e, d], converge uniformly on [c, d] to a solution of 
(3.8). In fact, let 
M, = max |@,(x)|. My = max |o(x)|, MM =M)+ M, 


cir esrted 
Then, as in (3.12), 


lp(x) — (x), = 


(a {f{E, o1(2)] — SLE, eo(8)] dé | 
IF te — eo@1 a8 | < K 


= |x — Xo 


K 


[* M dé 
<9 


AN 


MK, 


| pi f= ¥ A - | |x a Sal 2 
leas) — e203) < K | le) — ex dey < ~SAP MK* 
and in general | 
|x ~— Xo|" rn | 2 13) 
lSne(t) — $,(x)| < ——_—— MB (cq x < d). (3. 
ie 
But the series 
9 pipe pe eS Ril a eek Sit OS ie Sol seo “fe 
M + M + —— MK + —— MK : ————— MK 
2 n! 


converges for all jx — xy]. Therefore the series (3.11) is uniformly convergent 
on [c, d] (why ?). Moreover, it follows from the relation 

Ax) = Gn(%) + [Omar) — Pn0)] + [Oneal X) — PneilX)] + °° 
and the estimate (3.13) that 
Espa? | nikal at 
(m2 -+ 1)! (mn -- 2)! 
This allows us to estimate the deviation of the mth approximation from the 
exact but still unknown solution. 


| . [ 
lolx) — g(x) < MK™ |x — xol™ mers 
ie 


** This ts compatible with {(%, y) being unbounded in the strip. 
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Remark 4. Sometimes a differential equation is specified in a domain 
together with some or all of its boundary. Then the situation remains the 
same if (Xq, Yo) 1s an interior point. However, if (xp, Yo) is a boundary point, 

existence of a solution passing through (xp, y,) is far from being guaranteed. 
However, the same proof as before shows that two distinct solutions satisfying 
the same initial data cannot exist. 


Problem I. Prove that if f(x, y) is continuously differentiable up to order k, 
then @y.10¥), Py.a(%),... are continuously differentiable up to order & + 1 and 
the sequence of jth derivatives of 9,(x), fj =1,...,4 +1 converges uniformly 
to the jth derivative of g(x) on every finite interval where all the 9,,(:) exist [and 
all the graphs y = 9,,() pass through G’J. 


Problem 2. Show that if 


lo(x) — a(x)] < clx — x," (agx<ba<xy<b,c > 0,d > 0), 
then 
K K> 
=~ Po Ssh eG Rf, Se as, aa es 
io(x) — 9, (x)| < ¢ lx — xo 3 d@tidrD” | 


provided only that the graph of any function (x) satisfying the — inequality 
passes through G’. This estimate allows us to go from the ( + 1)th approxima- 
tion to the exact solution if the (7 + [)th approximation differs only slightly 
from the ath, 


Problem 3. Show that the solution of the equation 4’ = x* + y* on the 
interval O < x < 1, subject to the condition y(0) = 0, satisfies the inequality 


ly = (ix° > gar) < 0.001528. 


Hint. Apply the result of the preceding problem, choosing (x) = $x° and 
studying the given differential equation in the closed domain 0 <x <1, 
lyi < N for suitably chosen NV > 0. 

Problem 4, One way of introducing the concept of a generalized solution of 
the differential equation y’ = /f (x, y) in the case of discontinuous /(x, y) is to gO 
over to the integral equation (3.8). Suppose, that f(x, ¥) is continuous in both 


xand y everywhere | in G, except possibly at a finite number of “‘singular’’ values 
of x. Suppose further that f(x, ) is bounded and satisfies the inequality 


if (x, ya) = x, ¥y)I gy v(x) by aa Vi 
at the “nonsingular” values of x, where ¥(x) is continuous for such x and 
Mo 
| uix)dx < «0 
eG 


(here (a, b) is any finite interval]. Then define the generalized solution of y" = 
f(x, ¥) satisfying the initial condition y(x9) = Yo to be the continuous solution 
of the integral equation (3.8). Prove the existence and uniqueness of this soluuon. 


f Such a solution does exist, for example, if the domain isa strip xX) GX < b. 
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Comment. The set of singular values of x may be infinite, but then extra 
precauti ons are necessary (this is the “generalized solution in the sense of 
Carathéodory”). 


approximations to the solution of y’ = 3° on the interval 0 <x < 2, satisfying 

the initial condition (0) = 1. What happens if the equation ts replaced by 
a 

yo =yorbyy = -¥? 


15. The Principle of Contraction Mappings 


The method of successive approximations, just used to prove the existence 
and uniqueness of solutions of differential equations, has applications to 
many other problems of analysis. This makes it important to find the most 
general conditions under which the method is applicable. Then instead of 
setting up the whole method in cach special case, we need merely verify that 
the conditions for applying the method are actually met. 

THEOREM | (Principle of contraction mappings). Let PD be a nonempty 
family of functions o(x), each defined on the same set E, satisfying the 
following conditions: 


1) Every function © is bounded, i.e.,*® 
sup |o(x)| = Ms < oo 
rék 


(in general, the bound depends on ©); 
2) The limit of any uniformly convergent. sequence of functions in D 
also belongs to PD; 
3) There is an operator A defined on © carrying every function © in 
® into another function Ag in O; 
4) The inequality 
sup |A9(x) — Ad(x)] << m sup |e(x) — 600) 
rek rel 


holds for every pair of functions @, @ in D and for some fixed m in 
the interval 0 <. m <1. 
Then the equation 


o = Ae (3.14) 


has one and only one solution belonging to the family ©. 


See ¢.g., G. Sansone, Equazioni Differenziali nel Campo Reale, second edition, N. 
Zanichelli Editore, Bologna (1949), Vol. 2, Chap. 8, Sec. 8. 

The symbol © means “belongs to” or “is an element of.’ By sup is meant the least 
upper bound with respect to all points x € E, oa 
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Proof. Choosing any function % in the family , we form the 
function 
0, = A%, 


which we call the “first approximation’’ to the desired solution of 
equation (3.14), The function 2, belongs to ®, because of condition 3, 
and hence we can construct a “second approximation” by setting 


Oe =r At, 


where the function %» also belongs to . Continuing this construction 
indefinitely, we obtain a sequence of functions 


Por Pls ares Duseees (3.15) 


where 0, = Ao,_, for every & > 1. 
We now show that the sequence (3.15) is uniformly convergent on 
the set &. As in Sec. 14, consider the series 


09-4 (By Oy) + (03 —— O)) (3.16) 
If sup |p| = 47, and sup |o,| = AY, (recall condition 1), then 
sup [9, — Gol < My + M, = M. 
Applying condition 4, we have 
SUP [pn1 — O,| = sup |do, — AQ 4] < mi sup lo, — O11. 
Therefore 
sup |9o] -+ sup lo, — eof - sup [ga — gil + sup 195 — al + °° 
< M + M + Mim + Mim + -°-, 


and hence the sequence (3.15), whose terms are the partial sums of the 
series (3.16), converges uniformly on £ to some function 9. By condition 
2, this limit function © belongs to the family ®, and hence it makes 
sense to write A@. Then, by condition 4+, we have 


sup |do — Ag,_| < msup lo — 9-11. 


It follows that de,_, converges uniformly to Ag as k — 9, since 
O — 0, _, converges uniformly to zero as k —» co. Therefore we can pass 
to the limit as & — 0 in the relation 


Qe = AQ 1, 
obtaining 
Oe AO; 


; ’ 


Which agrees with (3.14). 
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To prove the uniqueness of 9, suppose (3.14) has another solution ¢ 
vt 


in the family (. Then 


sup leo ast o| = Sup lo == As| NN sup Ke — |, 


which ts possible only if sup | — 9| = 0 or @ = 9, since m <1, 


Therefore 9 is unique, as Seited 


We now give some examples illustrating the applications of the principle 
of contraction mappings. 


Exaniple 1, First we mak how the principle of contraction mappings 

ields the méthod of successive approximations. With [a, 6] and G’ the same 
as in the theorem of Sec. 14 ne E and ® be the interval [@, 6] and the family 
of all continuous functions 2(x) defined on [a, 5] whose graphs lie in G’. Then 
conditions | and 2 of Theorem | are obviously satisfied. Moreover, let A be 
the operator carrying ¢@ into 


Ao=yot|" SE, 9(8)] dé. 
“Zo 


As shown in Sec. 14, this operator satisfies conditions 3 and 4 if the interval 
[a, b] is sufficiently small. Therefore Theorem | implies the existence of a 
unique continuous solution of the integral equation (3.8), or equivalently the 
existence of a unique solution y(x) of the differential equation y’ = f(x, y) 
satisfying the initial condition \(%»)) = Vo. 


Example 2. Consider the integral equation 
(x) = f(x) + x K(x, Z)a(%) dé, (3.17) 


where 7 is a constant, f(x) is continuous for a < x < b, and the “kernel” 
K(x, 6) is conunuous for a.< x < boa < ~€< 6. Choosing E and ® to be 
the closed interval [a, 5] sai the fainil ly of all continuous functions (x) on 
[a,b]. we immediately verify that conditions 1 and 2 of Theorem | are 
satisfied. Let A be the operator carrying 9 into 


fy es, wm - 
Ag = f(x) +4] K(x, 22) dé. 
d 
This operator clearly satisfies condition 3. Moreover, 
~ - Be , aad ol ll dh oad hall 
Ae — 43] = |2]" K(x, DLE) — BE) a 
“a 


< [al M(b — a) max |o(x) — o(x)I, 
arb 


where 
M = max [K(x, 2)I, 


rae eu 
ae a ae 
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and hence condition 4 1s satisfied if (6 — a) |2| Wf < 1. Therefore Theorem | 
implies the existence of a unique continuous solution o(x) of the Integral 
equation (3.17) ona < x < b. 


Example 3. Suppose f(x) is defined for all real x and satisfies a Lipschitz 
condition with constant K <1. Then it follows from the principle of con- 
traction mappings that the equation x = f(x) has a unique solution. In fact, 
let the set £ consist of a single point xg, and let (b = {9} be the family of all 
finite functions defined on £. Then {o(x,)} is just the set of all real numbers, 
(note that each @ takes only one value). Clearly conditions 1 and 2 of 
Theorem | are satisfied (trivially). Moreover, let A be the operator carrying 
o into the function whose value at x, is just f [9(x,)]. Since fis defined for all 
real x, condition 3 ts satisfied. To verify condition 4, we merely note that 


FS 3 eat (C2) ae Gr eee: a © Gee 
Finally we use the principle of contraction mappings to prove 


THEOREM 2 (/miplicit function theorem). Let f(x, ¥) be defined on the 
sip aax <b, —2e <y < w, and suppose f(x, ¥) is continuous in x 
and differentiable in y, where 


<mefasy<M< 0 
everywhere tn the strip. Then the equation 
f(x,y) =0 (3.18) 
has one and only one continuous solution \(Xx) on ac x < b. 
Proof. If £ is the closed interval [a, 0] and @ the family of all 


continuous functions on [a, b], then conditions 1 and 2 of Theorem 1 
are obviously satisfied. Now let A be the operator carrying 9 into 
| l 
Ag=o——f(x, 9). 
M 
This operator obviously satisfies condition 3, and moreover condition 
4 as well, since 


[do — Aol = 


| eee. 2 
a ss 9) one oO — — f XxX, o | | 
& rr ?, E M: ( ) 


(9-9) — 


Af 
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It follows from Theorem | that the equation 


ior 
eae eave 9). 


or equivalently the equation (3.18), has a unique continuous solution, 


* 
r * 
as asserted. 


Problem 1, Prove that Theorem I remains valid if condition 1 is replaced by 
the weaker condition that the differences between any two functions in ® be 
bounded. 


Problent 2, Give examples showing that each of the four conditions figuring 
in Theorem 1 is essential. Also verify that m7 = | is not allowed. 


16. Geometric Interpretation of the Principle of 
Contraction Mappings 


Suppose we think of the family ® figuring in the principle of contraction 
mappings as a set of “points,” with the quantity sup jo — %| as the ‘“‘distance” 
between two points 9 and ¢ in ®, With this interpretation, condition 2 says 
that every “limit point” of an infinite sequence of points of the set @ belongs 
to ®, i.e., that the set @ is closed. Condition 3 means that the operator A 
carries every point of ® into another point of ®, while condition 4 means that 
if A carries the points 9 and ¢ into new points o* and 5*, then the distance 
between o* and o* cannot exceed m times the distance between © and ©, 
where 0 < m <1. Finally, finding the solution of equation (3.14) in the 
class of functions ‘() = {9} means finding a point of the set which remains 
fixed under the action of the operator A. 

The fact that such a fixed point exists is geometrically obvious, if it is 
assumed that the set © is bounded. Then the distances between pairs of points 
of ® have a least upper bound d, called the “diameter” of the set ®. Suppose 
we think of the set ® as a closed domain, bounded by 
a curve / (see Figure 11). Then, letting A act upon all 
the points of ®, we obtain a new set @,, which is con- 
tained in @, by condition 3. Moreover, by condition 
4, the diameter of @, is no greater than md. In Figure 
Il, the set , is the region bounded by the curve /). 
Applying the operator A to the points of @,, we obtain 
anew set D,. Since A carries the point of @ into points 

Figure 1] of ,, it certainly carries the points of @, (a subset of 
®) into points of ®,, i.e., @, is contained in @,. By 
condition 4, the diameter of , cannot exceed md. In Figure 11, the set (D, 


Sal 


is the region bounded by the curve /,. 
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Continuing this process indefinitely, we obtain an infinite sequence of 
“nested”? closed sets P, (P,, D,, ..., D,, ..., where the diameter of @, 
approaches zero as n—» co, Therefore the intersection of all these sets 
consists of a single point, which is obviously fixed (invariant) under the 
action of the operator A. There cannot be two distinct fixed points, since 
then the distance between the two points would be positive and unaffected 
by A, contrary to condition 4. 


Problem I. Let F be a closed bounded set in a-dimensional space, and sup- 
pose / is mapped into itself by an operator A such that 


o(Ap, Aq) < <(p,q), (3.19) 


where ¢(p, q) denotes the distance between the points p and q. Prove that pre- 
cisely one point remains fixed. Is this true for sets which are bounded but not 
closed? How about closed unbounded sets? 


Problem 2, Prove that if the inequality (3.19) is replaced by 
o(Ap, Ag) < me(p, 9) (O<m <1), 


then there is a unique fixed point for arbitrary closed sets (even if unbounded). 
Problem 3. Suppose the inequality (3.19) is replaced by 
e(p, lq) < <(p, 9). (3.20) 


Prove that there is a unique fixed point if F is a line segment or the boundary 
of an isosceles right triangle, but not if Fis a circle or if Fis the boundary of an 
isosceles right triangle and ¢ denotes the shortest distance measured along the 
boundary. Show that there is a unique fixed point if (3.20) holds and if Fis any 
closed bounded canvex set (the proof requires the use of a preliminary similarity 
transformation). Give an example of a closed bounded set and a mapping of this 
Sct into Itself satistying (3.20) which has no fixed points and does not reduce to a 
motion. Can the set be a circle? 


Problem 4. Show that tf o(x) is a continuous function on f[a, &] such that 
a < o(x) <b, then there is a point xy in [@, 6) such that ¢(xp) = xp. In other 
words, show that a continuous transformation of a closed interval into itself 
has at least one fixed point. 


Comment, The analogous theorem is valid for an a-dimensional ball (see 
See. 58), 

Problem 5 (OQ. A. Oleinik)., Let f(x,y) be a continuous function defined for 
all real x and y, such that 

a) /(x, y) satisfies a Lipschitz condition in y; 

b) ff + Tis) = fx, y) for some T > 0; 

c) fw eD/G, vy) <0 for all x and some yy, ¥9. 
Using Prob. 4, prove that the equation py = f(x, ) has at least one periodic 
solution, with period 7. Apply this result to the equation 3” = a(x)y + o(x), 
Where a(x) 52 0 and &(x) are continuous periodic functions, with period 7. 


é 
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17. Cauchy's Theorem 


A -Ailiclionid iewe<5 X,) is said to be analytic in. all its arguments ing 
neighborhood 


u pay ee im # 
IX; ice x, mT k aa ? coe gy fll (3.21) 
of the point (xf, ..., 4) if it has a power series expansion in the variables 


x, — x8,....X, — ¥? in the neighborhood (3.21), Here we allow both the 
function and its arguments to take complex values. By the derivative of an 
analytic function F(x,,...,¥,) with respect to any argument x, (in general 
comple), we mean the function whose power series expansion is obtained by 
term-by-term differentiation of the power series of F(x,,..., X,). As is well 
known, the differentiated series continues to converge in the neighborhood 
(3.21).!" Throughout this section, it does not matter whether the quantities 
under consideration are real or complex, since the rules for differentiating 
sums, products, composite functions, etc., are the same in both cases. 


THEOREM (Cauchy's theorem). Jf f(x, v) is analytic in both.x and 
y ina neighborhood of the point (Xq. 9), then the differential equation 


A oe (3.22) 


ax 


has a unique solution (x) satisfying the initial condition 


30%) = Yo (3.23) 


and this solution is analytic in a neighborhood of Xo. 


Proof. There is no loss of generality in assuming that xy = Yp = 0, 
since otherwise we need only transform to new variables x* = x — X, 
y* = y — yo. Suppose (3.22) has an analytic solution equal to zero at 
x = 0, with power series 


y(x) = Cy + Cyx + Con? + Cyr? + eee. (3.24) 


Then obviously C, = (0) = 0. Substituting (3.24) into (3.22), repeatedly 
differentiating the result with respect to x, and then setting x = y =0 
in each of the equations so obtained, we find the following (infinite) 


? See eg, T. M, Apostol, Advanced Calculus, Addison-Wesley Publishing Co., Inc., 
Reading, Mass. (1957), Secs. 12-15 and 13-15, ) 

_* From an historical standpoint, Cauchy's theorem is the first proof of existence and 
uniqueness of a solution of (3.22), subject to the initial condition (3.23), where f(x, J) so 
a rather general form. However, the requirement that f(x, y) be analytic is often artificial, 
and in fact f(x, y) fails to be analytic in many problems stemming from the applications. 
Thus a more general theory, like that of Secs. 11, 12 and 14, is indispensable. 
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system of equations satisfied by the coefficients C,: 
Cy = y'(0) = f(0, 0), 
2C, = y¥"(0) = f.(0, 0) + f,(9, 0) y"(0) 
= f(0, 0) + f,(0, Cy, 
3-20, = y"(0) = fre(0, 0) + 2f,,(0, 0)9"(0) + f,,(0, 0)y"°(0) 
+ f,(0, O)y"(0), 
= f,(0,0) + 2f,,(0,0)C, + f,,(0, Cr + f,(0, 0)2C., 
(3.25) 
The coefficients C,, are uniquely determined by (3.25), and hence there 
can be no more than one solution of (3.22) satisfying the condition (3.23). 
It is important to note that each C, can be expressed in terms of sums 
and products of the coefficients C,,...,C,_,; (with /ower indices) 
and the derivatives of f(x, y) at the origin, where the latter are coefh- 
cients of the power series expansion 


IS aa (3.26) 
Pe 


(assumed to converge in some neighborhood [x] <r, |y] <r). 

From now on, let (3.24) denote the series whose coefficients are 
determined from the system (3.25). Then the proof will be complete, 
once we succeed in showing that (3.24) converges in some neighborhood 
of the point x = 0, y = 0. In fact, substituting (3.24) into (3.22) and 
expanding the right-hand side in a (double) power series, we find that 
identical powers of x have the same coeificients in both sides of the 
resulting equation.?° To establish this convergence, we shall consider the 
auxiliary equation 


— = F(x, 2), (o27)} 
Where f(x, 2) is analytic in a neighborhood of the origin and has a power 


series expansion 


ss 


Pye > ways 
ki=0 
whose coefficients are related to those of (3.26) by the inequalities 
[a;..| Sy |Az..| (x, l= 0, L 2 o + ye 


The equation (3.27) is said to majerize equation (3.22), and similarly the 

function F(x, 2) is said to majorize (or to be a majorant) of the function 

' Here we use the theorem on substitution of one power series Into another, See 
eg, T.M. Apostol, op. cft., p. 444. 
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f(x, 2). f(x, ¥) 1s analytic in the neighborhood |x| <r, [yl <r, then 
f(x. x) has a majorant of the form 


M 


. (xl<ry lle, 


F(x, 2) = 


¢ x pa =) ; bs 
(1 = S| (1 oa =| 
; ee ae 
where 0 <r’ < rand Af is a suitable positive constant. 
Thus suppose we have found an analytic solution 
! be oh hoo #3 
a(x) = Cg C8 a Ganka CaN ae es (3.28) 


of the equation 


satisfying the initial condition 2(0) = 0 [so that Cy = 0]. Then the 
series (3.24) will converge whenever (3.28) does. In fact, each CF isan 
expression involving sums and products of the coefficients C¥,..., CE, 
and the derivatives of F(x, y) at the origin, and this expression is the 
same as that for C, except for the asterisks and the presence of deriv- 
atives of F(x, 1) instead of derivatives of f(x, y). But F(x, y) majorizes 


*" In the case of one variable x, suppose the series 


oD 
JX) = > agx* 
k=0 


converges for ail x in (0, r). Choosing r’ in (0, r), we form the convergent series 


iw #7 
Ty { 7K 
> 1az| 0%, 
rizO 


with sum Af. Then 
laz| r’* <M, 
and hence 


iv 
la,| i eae Ay. 


p's 
The function 
wy mo Af 
i 
a 5. : ae ear . & 
Fix) = > ALN == > SE 
i--0 f= Q 


Is Obviously a majorant of f(x) in (0, r’). But clearly 


EG: — as M 

oS) Mia] Se: 
es r aS 

ect 4 é ] a 


- 
a 


The generalization to two (or more) variables is immediate. 
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S(x,y), and hence? 
Cr s Cr (& adil I, 2; as y 


so that the series (3.24) for (x) converges by the comparison test. To 
find z(x), we need only consider real values of x (why ?). For real x, we 
can solve 


by separation of variables, obtaining 

val 3 ee ae: er 
| ( eS ae gene 
o\ y 19 


z 
i-= 
OF 


z F x 
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which vanishes for x = 0 and is an analytic function of x for sufficiently 
small |x|, 1.e., for 
n= r i pled!) 


(cf. footnote 19, p. fn ). Therefore the formal solution (3.24) converges 
for sufficiently small |x| to an analytic solution of the differential 


equation (3.22) ra the initial condition (3.23). The proof of 
Cauchy’s theorem is now complete. 

COROLLARY. [f f(x, ¥) is analytic on a domain G,* and if f(x, ¥) is real 
for all real x and y, then every real solution of the differential equation 
Woes f(x,y) whose graph lies in G is analytic. 


Proof. Wf f(x, ¥) is analytic on G, then every point (X%9, Yo) of G has 
a neighborhood ~f in which f(x, y) satisfies a Lipschitz condition in y. 


°" For example, according to (3.25), 
IC] = [£(0, 0)| < FO, 0) = 
[Cal <= 21 fC, O] + FLO. 0 [Ci] < $F. 0) + 4,00, OCP = CH, 


and so on, 
“Le, analyuic ina neighborhood of every point of G. 
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But then, by Osgood’s theorem, there can be no more than one solution 


inf equal to vp for X= No. and this solution must coincide with the 
analytic solution constructed above. 
Remark. If the function /(, y) appearing in (3.22) is linear in y, we can 
find better estimates for the domain of existence of solutions of (3.22) than 
in the general case. In fact. consider the equation 


dy Mi ' . ( 
— <= a(x)\ a DES», (3.29) 
aX | | 


where a(x) and h(x) are analytic functions of x for |x| <r. Then (3.29) is 
majorized by the equation 


dz Mo. 


= —— (z + 1) (O92 8); (3.30) 
dx x 
Saag 
; 


where the constant .W is a common majorant of both a(x) and A(x). But for 
Ix] <r. (3.30) has the analytic soluuon 


a 
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which vanishes for x = 0. Therefore, by the same argument as in the general 
case, the linear equation (3.29) has an analytic solution for |x| <r’, equal to 
zero for x = 0. 


Problem. Find the first four terms in the expansion of the sciution of the 
equation y” = e*” (}(0) = 0] in powers of x. Find upper and lower bounds for 
the remainder term and for the radius of convergence. 


18. Smoothness of Solutions 


TueoreM. Jf f(x, ¥) has continuous derivatives with respect to x and 
y up to order p = O (inclusive), then every solution of the differential 
equation 


’ x 4 ” 
+ = f(x, y) (3.31) 
has continuous derivatives with respect to x up to order p ~~ I. 
Proof. \f y(x) is a solution of (3.31), then 


vx) = fi. OO), (3.32) 


“It p= 0, we regard ftv, ) as continuous, 
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where in writing (3.32), we presuppose that }(x) is differentiable and 
hence continuous, Therefore if f(x, }') is continuous in x and y, the right- 
hand side of (3.32) is a continuous function of x, and hence so is y’(x). 
Similarly, if p > 1 the right-hand side of (3.32) has a continuous deriv- 
ative with respect to x, and hence so does y’(x), 1.¢., v(x) has continuous 
derivatives up to order 2. Differentiating (3.32) with respect to x, we 
obtain 


yx) = LI, PON) + AD, yO) PG). (3.33) 


Assuming that p > 2 and applying the same argument to (3.33) as just 
applied to (3.32), we find that jx) has continuous derivatives up to 
order 3, and so on. 


Problem, Give an example of a differential equation (3.31) where f(x, y) is 
continuous but not differentiable, all of whose solutions are analytic. 


19. Dependence of the Solution of y’ = f(x, y) 
on the Initial Data and f(x, y) 


Until now, we have considered the problem of finding the solution of a 
given differential equation »" = /(x, )’) passing through a given point (xy, Yo). 
In general, of course, changing the point (Xo, ¥o) will change the solution, 
and just flow the solution changes is a question of great importance in the 
applications. As pointed out by Hadamard, this question is also fundamental 
to the whole subject. In fact, suppose the study of some physical problem 
reduces to finding the solution of a certain differential equation satisfying 
certain inital conditions, Then these initial conditions are usually found by 
making experimental measurements. But the ultimate accuracy of such 
measurements is limited. Therefore a solution y(x) determined by some 
initial condition y(vy) = jy would be of very little interest if slight inaccuracies 
in the measurement of yy (and xy) could greatly change +(x). 

Besides these considerations, it should be pointed out that using a 
diffcrential equation to study an actual physical process always entails certain 
idealizations, made with the intention of isolating the most significant 
features of the problem under discussion. For example, the molecular 
structure of matter is not taken into account In Example 2, p. 4. Therefore 
it is in any event only an approximation to describe an actual physical 
process by a given differential equation and boundary condition, and we 
would be equally justified in trying to satisfy all differential equations and 
initial conditions “sufliciently close’ to these actually selected, provided of 
course that these differences do not exceed those allowed by the given 
idealization of the actual physical process. Thus, in Example 2. p. 4, we 
could just as well satisfy any other initial condition differing from the given 
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‘nitial condition by less than the mass of a single atom of radium, Therefore 
the solutions of all “neighboring” differential equations satisfying “neighbor- 
ine” initial conditions should be ‘close together,” and in fact i ndistinguishable 
within the context of the given idealization. Otherwise, using the piven 
differential equation and initial condition to study the physical process in 
question might not be legitimate. 

Having made these observations, we now show that under certain 
conditions the solution of a differential equation depends continuously on 
the equation itself and on the initial conditions. 


THrorem.? Let f(x, ¥) be continuous and bounded on a domain G, 
and suppose there is a unique sohition of the differential eauation 


rd =x y) (3.34) 


passing through each point (Xo; ¥o) of G. Then the solution of (3.34) 

depends continuously on the function f(x,y) and on the point (Xo, Yo) in 

the following sense: Suppose the solution yy(x) of (3.34) passing through 

(Xge¥'o) is continued onto some interval [a,b], where a <x) <8. Let 

x*®. y*) be any point in G and f*(x, v) any function continuous on G. 
az Vo aa Ste snake | 

Then, given any = > 0, there isa 6 = 9(s) > O such that the inequalities™ 
* : * : 2 oe Aly 
sls, lip el <4, sup |f*(x, y) —S(x, <8 

(>, yleG 


imply that the equation 


y = f*(x, y) (3.35) 
has @ solution y*(x) passing through (xq. vg) and defined on [a, b] which 
satisfies the inequality 
sup {y'9(x) — Vo(x)| < «. 
acreb 

Proof. Let(x,, ,) be an arbitrary sequence of points in G approach- 
ing (Xg. No) as kK — oo, and let {f,(x, y)} be an arbitrary sequence of 
functions continuous on G such that 

lim sup [f,.(Cx, 3) — f(x, y)| = 0. 


koa (xr y)elr 
Moreover, let (x) be the solution of the differential equation 
¥y = f(x. y) a eee 
sauisfying the initial condition y(x,) = y,. Then we must show that” 


lim sup [yx(x) — yu(x)l = 0. (3.36) 


[toe 2g « i 
- So. A POL 


** A related but independent result on the continuous dependence of the solution ot 
(3.34) on the initial data will be proved in Sec. 21 (under stronger assumptions). 

* Concerning the notation, see footnote 16, p. 44. 

** This presupposes that yx.) can be continued onto the whole interval [a, 6], at least 
for sufficiently large &. This fact will emerge in the course of the proof. 
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To this end, let 4f be any number exceeding 


sup |f(x. y)[, 


(xr,vlety 
and construct a rectangle of the form 
|x — Nol < ¢, ly — vol < Me 
lying entirely in G. Then yo(x) can be defined on the whole interval 
Ix — Xl << ¢ and so can »,(x) for large enough & (why ?). Moreover 
lim sup. [ye(x) — yo(x)] = 0, (3.37) 


k-oos | r—so] se 


/\ 


since otherwise we could find an infinite sequence of indices k, < k 
and an « > 0 such that 


sup [¥z,(x) — yo(x)1 > 


[ux] he 


12° 
F 


(9 


(3.38) 


for all j= 1,2,... But the sequence {},(x)} is uniformly bounded on 
lx — xp] < ¢ (why?), and moreover equicontinuous on [x — x5] < ¢ 
since for all sufficiently large kK we have 


Lvs) = GG, vl <a M 


ee) SOs a ie |, 


Therefore, by Arzela’s theorem (see Sec. 10), the sequence {y', (x)} 
contains a uniformly convergent subsequence ¥,(x), ¥a(x),..., and by 
an argument like that given in the proof of Peano’s theorem (see Sec. 11), 
this subsequence converges to a solution of equation (3.34) passing 
through (Xp, Vo), 1.e., to the solution yo(x) since the latter is unique. But 
uniform convergence of ¥,(+), Pa(x), ... to p(x) on the interval [x — xp] < ¢ 
contradicts (3,38), thereby establishing (3.37) and proving the theorem 
if the interval [a, 5] figuring in the statement of the theorem is contained 
in |x ~— Nl < c 

If [a,b] is not contained in |x — x,| < c, we construct another 
rectangle 


and hence 


Ix — see ees a oe ly on pa ag Me, 


lying entirely in G, where 


(1) a ar OY ase ah PRS ; ; 
No = Ng mn ¢, vo = ¥o(Xo ). (3:37) 


As just shown, the point (x9, y,(407)) can be made arbitrarily close to 

(x(), yi?) for sufficiently large k. Therefore, applying the same argument 

to the solutions po(x), v(x) and the points (xO). 46¥), (xp? tin 

the interval [x — xf] < c¢, as just applied to the solutions yy(x). ¥,.(x) 

and the points (Xp, vy), (X,, Fe) In the interval |v — xl < ce, we find that 
lim sup v(x) — Vo(x)| = 0. 


keen [omrg! [Sey 
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> 


Therefore, taking account of (3.37), we have 


im sup Iya) — Fol) = 9, 
ee 


R-wom fact? By apyt cFey 


if the ee sign was chosen in (3.39). If [@, 4] is contained in [xy — ¢, 
. i @ ++ Gy], the proof is complete. Otherwise we construct further 


AL a 


rectangles 


x—x™ cq, ly —yel< Me 
lying entirely in G, where 
<a! = ere a (nas y a oe: eae ? 
until finally, using only a finite number of rectangles (why ts this possible 


and crucial?), we succeed in enlarging the original interval [Xo — ¢, 
Xp + ¢] to an interval [x, 8] containing [a, 6] such that 


a ee iy.(x) — ¥o(x)| = 0. 


2S 
This implies (3.36) and completes the proof. 


Remark. Let f(x.) be continuous and bounded on a domain G, and 
suppose there is a unique solution Vo(x) of the differential equation y' = 
f(x,y) passing through some point (%,.¥9) Of G. Then every solution of 
(3. 3) passing through the point (x%, y$) converges uniformly to yo(x) as 
ae oe sup [f* — f|—0. In fact, this is essentially all’ that was 
used to prove the theorem. 


Besides proving that the solution depends continuously on the initial data, 
it is sometimes important to prove the existence of derivatives of the solution 
with respect to the initial data. Let 3(x, Xo, Ye) denote the solution of the 
differential equation (3.34) satisfying the initial condition y = }'9 for ¥ = %p. 
Suppose we introduce a new function 


-_ = ee Xo, vo) eG 
and a new independent variable 
f=xX— Xp, 
so that 7 = 0, = = O corresponds to x = Ny, ¥ = Yo. Then 
== Ul + Xq, Xo. Fu) — Vo 


and the differential equation (3.34) takes the form 


dz : an 
f= f(t + xq 2 = yu) (40) 
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Therefore studying the dependence of the solution of (3.34) on the initial 
data reduces to studying the dependence of the solution of (3.40) on certain 
parameters appearing in its right-hand side. This problem was investigated 
by Poincare, and will be solved in Sec. 21 by using Hadamard’s Jemma 
(proved in Sec. 20). 


Problem 1, Prove that ifinstead of introducing the differential equation (3.34) 
in the statement of the theorem on p. 56, we simply assume that the function 
y*(x) satisfies the inequality 


ly = fCx, vy] <4, 


then y#(x) can be regarded as piecewise smooth. Also prove that the theorem 
of Sec. 13 becomes a corollary of the present theorem, if generalized in this way. 


Problem 2, Suppose y(x), (x) and u(x), @ < xy < bare continuous functions 
satisfying the relations 


ee 
Wx) = Ny +] fis vlads, 
i$) 


<' 


; a 
z(x) > Yor |” na [s, 2(s)] ds, 
oe “0 


u(x) > ¥y + | 7 SIs, u(s)] ds, 
Ce 

where f(x, ) is nondecreasing in y and continuous in a domain containing the 
graphs of all three functions. Prove that z(x) > v(x) for xg < x < 6, and more- 
over, if the solution of (3.34) satisfying the initial condition +(x») = Vy is unique, 
then w(x) > vx) for xy <x < b. [ff there ts no uniqueness, the right-hand side 
becomes the minimum solution (as defined in Prob, 3, Sec. 11) of equation (3.34) 
sausfying the imual condition ¥(v9) = yy.] Extend this result to the case of the 
opposite inequalities. Is it essential that f(x, ¥) be nondecreasing in y? 


Problem 3. Prove the following result, known as Gromnwvall’s lemma: If a 
continuous function ¥(x), xy <x < & satisfies the inequality 


* 


MX) Sef + |" B(s)y(s) ds, 


» 1) 


where the function B(x) ~ 0 is continuous, then 


VOX) = A exp is B(x) ds 


| beet bs 
| (x i Sony “A or b ) = 
v My : 


Extend this result to the case where B(x) is discontinuous or even unbounded. 
Under appropriate assumptions, obtain a similar estimate for a function y(v) 
sausfying the inequality 


y(x) ie ACX) — | B(s) v(s) dy, 


Use Gronwall’s lemma to deduce sufficient conditions for uniqueness of the 
soluuion of the equation 1" = fx, 1). 
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Problem 4. Find an estimate for a continuous function y(x) satisfying the 
inequality = 
yx) <A + [ B(s)e{s(s)] ds (Xp < x), 

~ <9 


where fis a constant, ¢(x) > 0 is continuous, B(x) > 0 and 


of the equation y" = f(x, »). Compare the result with Osgood’s theorem. 
Problem 3. Using the result of Prob. 3, find an explicit estimate for the 
deviation of yf(x) from yo(x) in the theorem on p. 56, if f(x, y) satisties a 
Lipschitz condition in y. Use this result in turn to deduce an explicit estimate 
for the deviation of an Euler line whose segments have a given maximum length 
from the exact solution, if f(x, 1) satisfies a Lipschitz condition in both yariables, 


Problem 6. Prove that the continuous dependence proved in the theorem on 
p. 56 is uniform in xf and yg if the point (vf, y#) belongs to a sufficiently small 
neighborhood of the curve 


VSHyl) (a cx < d). 
Problem 7, Consider the equation 
oe = (x _ h), (3.41) 


where /ris a constant. If > 0, (3.41) is called a differential equation with retarded 
argument. We are interested in solutions of (3.41) defined for all real x, It is easy 
to see that Knowledge of such a solution on any interval of length / determine 
the solution everywhere. Prove that if # > 0, then given any 4 > Oande >0, 
there exists ad > O such that |y(x)| <= forO <x <A if [yGy)| <dfor -A< 
x <0. However, prove that if # <0, then given any A > 0, there exists a 
sequence of solutions y,,(x) uniformly converging to zero for —«@ <x <0 but 
such that 
lim sup |y(x)} = 2. 


moe Orca 
Hint. To prove the last assertion, consider solutions of the form 
y = e*7 sin Bx, 
where « and @ are suitably chosen constants. 


Problem 8. Given a family of continuous functions defined on an interval 
(a, 5), let A be an arbitrary point of the strip a <x <b, —% <y<™ 
Suppose one and only one graph of a function f(x) of the family passes through 
the point 4. Prove that {,-(x) converges uniformly to f,(x) on [a, b] as Ad 
(here A’ is a variable point and A a fixed point of the strip). 

State an analogous result for a family of functions whose graphs fill a given 
domain. Roughly speaking, this means that continuous dependence on the 
initial data foliows from existence and uniqueness rather than from the fact-that 
the given functions are solutions of a differential equation. 
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20. Hadamard’s Lemma 


LEMMA (Hadamard). Let G be a domain in (xy, 0.64 Xo Z10 06 23 Zin 
space which is convex in X,...,Xy 7" and suppose F(x, 2.64 Xn 
Zaye 0+) 2) has continuous derivatives with respect to Xy,...,X, up to 
order 5 > QO (inclusive) on G. Then there are n functions 


ti 


D(x, | Xa Vi 2a S Vas =I) eee 2 “snd (i — l, se ey #1) 
with continuous derivatives with respect 10 Xy. 2.65 Xn: ¥in- ++) ¥n UP to 
order p — 1 (on G) such that 


FV, sees Vass “15 Bi oet wy Sp) FAR eee Mays 24; a eesy 2) 
(3.42) 
— Sb, (i: seey Xs M15 sts vn 245 eee ee Ler —_ X;). 
Teo] 


Proof.” We start from the obvious equality 


ue 


I (Vy, ee #4 Vy “19 +e * 4 ame — F(x, 7 f 2 4@ V3 “yy *“* *# 


a) 
. | | 
= |. PAN PAV Ric koa By PA Rs Evans eg Sal db 


valid because of the assumed convexity of G in x,,...,X,. Expressing 
F, in terms of derivatives of F with respect to the variables 


XY -4. (Oy oS xi). ene yg a an = (Cy, 1. X,), 


and denoting these derivatives by /y,..., £,, we have 


F Vigletstee Van 2 Gao ea Se es See 
7 
nt me (vy; — x;) (* Xe tp Se Ree sg 2p PIG OR a op ag 


~, 


Choosing the integrals in the right-hand side as the functions ®,, we see 
at once that they have all required properties. 


Problem 1, |s the representation of the function F guaranteed by Hadamard’s 
lemma unique? 


Problem 2. Prove that if the increment of the function F has the representa- 

ion (3,42) aeons the functions @; have continuous derivatives with respect to 
Neel Aa Vs+5 4 Sy UP tO order k ~ Oon G, then the function F(x,,..., %y5 
Zy,.+.,2,,) has continuous derivatives with respect to Xy,...,%, up to order 
A + lon G. This is to a certain extent the converse of Hadamard’s lemma. 
AP SOMME IN OG. a2. 4g Mar Oy deh) Spack, ne, the space of points (xj as ipl 
Tiree ee Zw) iS said to be convex in xy,...,X, if whenever two points (xy, ...5%0 
Tiveeee Sm) aNd (X},...,45.21,---, 2) belong to G, so does the whole line segment 
joining the two points. 

“" The derivative of order 0 will always mean the function itself. 

“* Due to M. A. Kreines. 
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21. Dependence of the Solution on Parameters 


THEOREM. Given a differential equation 


dy | 7 
ax , 


let G,, be a closed demain in tie xy-plane, let P be the open n-dimensional 
parallelepiped 


a 
7 ph 
iene hat 


& 


Gviere the u° are positive constants), and let G = G., x P20 Suppose 
PONY. ay. ty) tas bounded continuous derivatives with respect to 
Bettis... a, Up to order p > OQ (inclusive) on G*! and satisfies a Lipschitz 
condition 


| f(x. Vo, iA). see 8 uL) — f(x, ae they or ean r,.)| al K ie —e val 


in von G. Then, given any interior point (Xq. Yo) of G,,. there is a closed 
interval ax <b (a <Xy <b) on which the differential equation 
(3.43) fas a unique solution o(x. yy...) egual to Yo for x =X. 
Moreover, (xX, ty.....,) has continuous derivatives with respect to 
Ure... 2, up io order p on the set fa, b) x P. 


Proof. It is sufficient to consider the differential equation 


d aa f co 
—_ = Faes ¥, it), (3.44)- 
ax 
Whose right-hand side contains only one parameter, since the general 
cas¢ Is handled in exactly the same way. Using the method of successive 
approximations to find a solution of (3.44) equal to yo for x = Xp, we 
write 


= 


Pa eats e 
©,(X, U) = Vo | aa [<. (5); 12] tas 


G(X 2) = Yo +] SEE, o1(Z, uw). w) dé, 
* zg 


and so on, where ¢(x) is continuous on the closed interval [a, 5] figuring 
in the theorem of Sec. 14. It is apparent that every o,(x, w) is continuous 
in both x and » on the rectangular region Ria <x < 6, |u| < po in 
the xu-plane. Repeating the argument of Sec. 14, we find that the 
sequence {9,(x, u)} converges uniformly in x and p on & to a limt 


*’ By G., * P we mean the Cartesian product of Gz, and P, i.e., the set of all points 
Ce a 2.) such that (x, A hee Go and (t2),..., dn) EP. 

"| The case p = 0 has the same meaning as in footnote 28, p. 61. 

** Here K is independent of the variables x, ae 


' 
sy taege 
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function o(x, »), which is the unique solution of (3.44) passing through 
the point (Xo, Yo). Moreover, 9(x, p.) is clearly continuous in x and up 
on K. 
Now suppose p > |, and let 9(v, 2 -+- Ay) be the unique solution 
of the equation 
dy _ ; 
= f(x. yu + Ap) (3.45) 
dx 
satisfying the initial condition @(X,, p+ Aw) = yy. Substituting o(x, w) 
and (x, u + Av) into (3.44) and (3.45), we obtain the identity 


d 7a = f[[x, o(x,u + Au), uo + Av] — fLx, o(x, 2), 2], (3.46) 
(tN 


where Ae denotes the increment @(x, u -+- Av) — ¢(x, w). According to 
Hadamard’s lemma, the right-hand side of (3.46) 1s of the form 


Ao + dyAv, (3.47) 


where @, and ®, are continuous functions of x, o(x, 2), o(x, w + Ap), 
yp and uw -+ Au, and hence continuous functions of x and Aw (if we 


regard wu as fixed), since (x, 4) and o(x, u -+ Au) have already been 


shown to be continuous. Substituting (3.47) into (3.46) and dividing 
by Au, we find that Ag/Ap. satisfies the linear equation 


£(22) =, ae EAD, (3.48) 
dx \ Au. Au 


So far, the expression Ao/Ayw has only been defined for Au = 0. 
We now define Ao/Au for Ay = 0 by requiring that it satisfy (3.48) and 
vanish for x = xy. Regarding x, Au and Ao/Aw as separate variables, 
we sce that the right-hand side of (3.48) is continuous in the variables 
x and Ay (figuring in @, and @,) and has a bounded derivative with 
respect to Ag/Au. To verify the last assertion, we note that @, is the 
integral of certain values of éf/dy (see Sec. 20) and df /éy is bounded. 
Moreover, 


for every Au if x = xy. It follows from the first part of the proof that 
the solution Ag/Ay of equation (3.48) is continuous in Ap for all 
sufficiently small values of |Au|. Therefore Ag/Au approaches a definite 
limit as Au —» 0, and we have proved the existence of the derivative 


A(x. 11) 


Cu 
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Moreover, it follows from Hadamard’s lemma that 


es of ee. of 
lim @, os Itm D., — Gg 
Ay-*0 GO Apo Ow 


and hence @9/éy satisfies the differential equation 


2) of 09 . OO 3.49) 

ae de 3.4! 

=( do au Cu oe 
and the initial condition 

G9 | 

Cu z 


Seemed 
* 


Therefore, applying the first part of the theorem to (3.49), we find that 
éo/é@u is continuous in x and yw on the region R. 

If p > 2, we apply the same argument to equation (3.49) as just 
applied to equation (3.44), with dg/du now playing the role of ¢. In this 
way, we find that 079/éu? exists and is continuous in x and u on R. The 
theorem in its full generality is proved by repeating this argument.(and 
extending it to equations with several parameters u,,..., u,). 


COROLLARY. If f(x, ¥) has continuous derivatives with respect to 
x and y up to order p> 1, then the function y(x, Xg, Vy) Satisfying the 
differential equation 
=f») 


and equal to vy for X = X, has continuous derivatives with respect {0-Xq 
and Vo up to order p > 1. This assertion remains true for p = 0 if fis such 
as (0 guarantee uniqueness of the solution passing through (Xo, 0): 


Proof. Apply the theorem io equation (3.40). Also recall the 
theorem of Sec. 19 and the remark on p. 58. 


Remark 1. Given a solution (x, u9) of (3.44) corresponding to some 
parameter value u = uy, the derivative G2/du for u = u, and for values of 
x and } satisfying the equation y = ¢(x, u,) can be found directly (by 
quadratures) Without finding the other solutions of (3.44). In fact, under 


these conditions, @o/du is the solution of the linear equation (3.49) with 
known coefficients. 


Remark 2. By using the method of Sec. 19, we can establish the con- 
tinuous dependence of the solutions of (3.43) on the parameters 4, .. ++ Ys 
(or the differentiability of the solution with respect to these p yarameters) On 


any larger interval [, 3] containing (a, 6] such that the solution on [2, J 
lies in G., 
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Problem 1, Given the equation y’ = sin(xy) and the initial condition 
Xq = 0, Jo == 0, use (3.49) to find @y/éxy and dp/ ey (for arbitrary x). 

Problem 2. Given the equation y’ = x* -+ y* and the initial condition 
Xy =O, Yo = 0, find the value of dy/dyy at x = | to within an error of 0.0001. 

Hint, One way of finding y(x) is to use the method of successive approxi- 
mations. 

Problem 3. Prove that if the right-hand side of y’ = f(x, y) is continuously 
differentiable, then 


Oy(x, oF Vo) _ 
0X4 


—f (or yo) exp] |” fle, xOL adel, 


dy(x, Xo» yo) = 


aunt (™ 
i, exp{ [" fle. yl dt]. 


Problem 4, Let the functions Wf and N appearing in equation (2.21), Sec. 8 
be continuously differentiable, and suppose A7*(xX 9, Yo) + N“(x%9, ¥p) = 0, where 
(Xo, Vo) is a point of the domain G. Prove that (2.21) has a continuous integrating 
factor in some neighborhood of (Xp, Yo). 


Problem 3, Suppose that 


a) f(x, y) is continuous and satisfies a Lipschitz condition in y fora < x < 8, 


—-m cypc mw: 
b) f(x, y) > Ofory < F(x)and f(x, y) < Ofory > F(x), where F(x) is some 
function continuous on [a, 8). 
Then if 2 > 0, the soletion y(x, 2) of the equation 
dy 
p— = f(x, y) 
dx 
satisfying the initial condition y(vo) = 9, where a <xXq <b and yo > F(X), 
depends continuously on (x, 2) and is defined on [a, 6]. Prove that 


lim yxy, zu) = +0 (aX Xp) 
oy 
lim yx, 4) = F(x) (xp <x < OD), 


Ho O-b 


Where the second formula holds uniformly in x on every interval [c, 6] Go < 


¢ <b). 


Comment. Thus the way the solution of a first-order differential equation 
depends on changes of a small parameter multiplying »’ is essentially different 
from the way it depends on changes of a small parameter appearing in the right- 
hand side of the equation. The above result is a special case of a much more 
general theorem due to Tikhonov.” 


“ALN. Tikhonov, On the dependence of solutions of differential equations on a smalt 
Parameter (in Russian), Mat. Sb., 22, 193 (1948); On systems of differential equations con- 
fainting parameters (in Russian), ibid., 27, 147 (1950). 


66 GENERAL THEORY CHAP, 3 


22. Singular Points 


Consider the differential equation 


dy 
a= f(xy) (3.50 
dx Got) 
or 
dx eo I _ 
Pee | Olt? caaerrenane (3.50’) 
dy I(x. ¥) | 
(see Sec. 2) defined on a domain and possibly on some or all of the boundary 


e Sec 
of G. Then a point P in G is said to be an ordinary point of equation (3.50), 
(3.50') if there is a neighborhood «4° of P such that® 


a) One and only one integral curve of (3.50), (3.50°) passes through 
each point of 1; 
b) At least one of the functions f(x, 9), (, }”) is continuous on 4”, 


Otherwise P is said to be a singular point of equation (3.50), (3.50'). 

A sufficient condition for P to be an ordinary point is that f(x, y) be 
continuous in x and satisfy a Lipschitz condition in y on .4”, or that /i(, y) 
be continuous in y and satisfy a Lipschitz condition in x on 4°. However 
this condition is not necessary, as shown by the equation 


, {vin iy for == 0, 
y= ; 
: | 0 for y= 0. 


In this case, every point of the plane is an ordinary point, although the 
Lipschitz condition breaks down on the x-axis. 

The point P is singular in each of the following cases: 

1. P belongs to the boundary of G. For example, the origin is a singular 


point of equations (1.4) and (1.7), since in both cases G is the whole 
plane except for the origin. 


hm 


. P is a “point of nonuniqueness” (i.e., two or more integral curves 
may pass through P in every neighborhood of P), or P is a limit point 
of points of nonuniqueness. 


Lae 


. The given direction field has a discontinuity at ?.*° 


34 ’ 4 ” * Key A 

: By a neighborhood." of a point P we shall always mean a “complete neighborhood 
of P, cz. @ sufficiently small disk with P as its center, and not just the intersection of 
such @ disk with the domain G. Neighborhoods of other shapes will be allowed (cf. pp. 76 
170). | 

AS ae Pare Pigs Pee: - . . Sea ing tit 

Instead of a Lipschitz condition, we might require that the corresponding partial 
derivative be bounded on 4”, | 

*“ This case is seldom encountered in practice (however, see Sec. 23). 
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Remark. Naturally, combinations of these cases (e.g., a boundary point 
of nonuniqueness) are possible. 


By an isolated singular point we mean a singular point which has a 
neighborhood containing no other singular points. These are most frequently 
encountered in studying equations of the form 


dy — M(x.y) 


= (3.51 
dx N(x, y) 
or 
N. INUS SY ot 
dx N(x. y) Gar) 


dy M(x, ¥) 
where Af and 4 have continuous partial derivatives with respect to x and y 
of sufficiently high orders. Clearly, any point at which Af and N do not vanish 
simultaneously is an ordinary point of equation (3.51), (3.51’). If on the other 
hand, (xy, Yo) is a point where Af and N both vanish, we assume for simplicity 
that Xy = Vy = 0, and then expand Af and N in powers of x and ¥ up to 
terms of order two, obtaining 
dy M0, 0)x -- M,(0. O)y -+ O(x* + 5°) 
dx N,(0,0)x + N,(0, O)y + O(x? -+ ¥°) 
in a neighborhood of the origin.*? This equation does not define dp/dx at the 
origin, Moreover, dy/dx does not approach any limit as (x, y) —- (0, 0) if 
M (0,0)  Af4,(0, 0) | es 
N(0, 0) N,(0, 0) | 
(see below), and hence no definition of 37(0, 0)/N(O, 0) makes Af(x, ¥)/ N(x, ¥) 
continuous at the origin, 1e., the origin is an (isolated) singular point of 
equation (3.51). (3.51'). 

Perron has shown*’ that the terms O(x* +- °) appearing in the numerator 
and denominator of (3.52) have no effect at all on the behavior of the integral 
curves near the isolated singular point (here the origin), provided that the 
real parts of both roots of the equation 

}A— N(0,0) —N,(0,0) | _ 0 
| —A,(0,0) % — S,(0, 0) 


are nonzero (cf. Sec. 56). Therefore to get an idea of how the integral curves 


(32) 


0 


“ By O(x* ++ y%) is meant a quantity whose ratio to x? + y* remains bounded [as 
(x, ¥) -» (0, 0)). 

*O. Perron, Uber die Gestalt der Integralkurcven einer Differentialgleichunyg ersten 
Ordaung in der Umgebunyg eines singuliiren Punktes, Math. Z.. US, 121 (1922); Part 2, ibid., 
16, 273 (1923). See also [. Bendixson, Sur les courbes deéfinies par des equations differenticlles, 
Acta Math., 24, 1 (1901) and M. Frommer, Cher das duftreten von Wirbeln und Strudeln 
(geschlossener und spiraliger Integralkurven) in der Umeebung rationaler Unbestimunithetts- 
stellen, Math. Ann., 109, 395 (193-4), 
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behave near the origin, we need only study the integral curves of the equation 


dy ax + by 


ee 3.53 
dx ex-+dy (3.93) 


where 


c hUd 
It ean be shown (see Sec. 48 below) that a nonsingular linear transformation 
X= hyS + kyo, 


¥ = kad + Kagt 


a 


with real &,, reduces (3.53) to one of the following three forms: 


dt; 7, 
—! = k= (ke); (3.54) 
dz - 


aS S 


We now consider each of these cases in detail, noting first that although the 
>and y-axes intersect at right angies, the same is not true in general of the 


and y-axes. However, to keep things simple, the & and y-axes will be 
represented by perpendicular lines in the figures. 


ad 


Five 


Case 1. A complete integral of (3.54) is 
and the corresponding behavior of the integral curves is shown schematically 


in Figures 12, 13 and |. Fi eure.2 pertains to the case where k > 1. Here, 
except for the two halves of the %-axis, all the integral curves are tangent to 


qf 


Ficure }2 FiGurE 13 
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the Z-axis at the origin. Phe ¢ and y-axes are integral curves, except of course 
at the origin ttsell’. ae equation (3.54) defines no direction at all. The case 
e =: | has already been treated in Chap. | (sce Figure 1, p. 7). FO <k < 
all the integral curves except the two halves of “the Z-axis are tangent to ae 
q-axis, as shown in Figure 13. 

Examining the figures. we see that if &k <> 0, every integral curve ap- 
proaches the origin O along a definite direction (i.¢., has a definite tangent at 
O). In general, a point ? ts called a node if every integral curve with points 
sufliciently close to P approaches ? along a definite direction. Thus, if k > 0, 
ihe point O is a node for the integral curves of equation (3.54). 


Ficgure 14 Figure 15 


The integral curves 
q|s[" = 


corresponding to the case & << 0 are shown in Figure 14. Now only four 
integral curves approach the point O arbitrarily closely, i.e.. the two halves of 
the ¢-axis and the two halves of the ‘q-axis. Every other integral curve comes 
to within a certain distance of O and then begins to recede from QO. Singular 
points of this kind are called saddle points (the integral curves rese mble the 
level lines on a map corresponding to a mountain pass). 


Case 2. A complete integral of (3.55) is 
br, = 2(a + 6 [In s)), 


and all the integral curves are tangent to the y-axis at the origin O (see 
Figure 15). This time the q-axis but not the ¢-anis is an integral curve, and the 
point O is again a node, as in Case [ for & > 0. 


Cuse 3, Equation (3.56) is most easily integrated by going over to polar 
coordinates, Thus, writing 


6= 6c0s 9, y= 9 SIN 9, 
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we find that (3.56) is equivalent 


snes en ko, 


and hence 
9 == Ger: 

If k > 0. all the integral curves approach O winding around O infinitely 
often as go—> —%, while if k <0, the same thing happens as 9 > ep 
(see Figure 16). In either case, O is called a focus, 
i & = 0, the family of integral curves of (3.56) con- 
7 sists of concentric circles with their centers at O, In 
general, a point P is called a center if some neigh- 
| borhood of P is completely filled by closed integral 
>: curves, each containing P in its interior. Thus, if 


Sage 


g k == 0, the point O is a center for the integral curves 

of equation (3.56). However, a center may well turn 

| into a focus when the higher-order terms appearing in 
Figure 16 the numerator and denominator of (3.52) are taken 


into account, i.e., in this case the behavior of the 
integral curves of (3.51), (3.51') is not determined by the first-order terms. 
Later on, in Sec. 48. we shail see that & vanishes if and only if the real parts 
of both roots of the equation 
zA—c —d 
vanish. This cannot happen in any of the other cases considered above. 
The classification of singular points given in this section is due to Poincaré. 


Problem 1, Examine the behavior of the integral curves of the following 
equatuions near the appropriate singular points: 


ae ; ss, tke 
a) yo =a; bo w=: Cy as 
: = : 2 ; ye 
\- 
dj wat: Co aa (for integral a and 2). 
Fa : : | 


(there are four distinct cases). 


ee) GENERAL THEORY 7] 


Problem 3. How do the integral curves of equation (3.53) behave if 
a ob 
=] 
Cd 
Problem 4, Give an example of a singular point P with the property that a 
unique integral curve passes through cach point of some neighborhood of P. 


Problem 5, For what values of a, 6, cand d does equation (3.53) go into itself 
a) under all homogeneous affine transformations; b) under all rotations? 


23. Singular Curves 


DEFINITION. If every point of a curve is a singular point [of equation 
(3.50), (3.50’)], the curve is said to be a singular curve. If a singular curve 
is also an integral curve, it is Said to be a singular integral curve. 


Example 1, Every curve which ts not an integral curve of the equation 
= f(x. ¥), 
where f(x, )') is the function constructed by Lavrentev and alluded to in 
Sec. 9, is a singular curve (but obviously not a singular integral curve). 
Example 2, The x-axis is a singular integral curve of the equation 
yln?|y| for y #0, 


0 for ¥ nore 0, 

in fact an “integral curve of ala ee In practice, it is usually 
important to find such curves. since they help to form a picture of the 
behavior of the integral curves “in the large’ (see below). 


Example 3. Let G be the domain on which at least one of the functions 
f(x, ¥), ACY. ¥) appearing in (3.50), (3.50’) is defined. Then a curve is a singular 
curve of (3.50), (3.50’) if it is part of the boundary of G. Such a curve may 
also be an integral curve, — ¥) or f,(x, ¥) ts defined on G. Then we have 
a “boundary integral curve 

Example 4, The x-axis is an integral curve of the equation 

| for y > 0, 
; sone, QO for 4eSO: 
— for vO, 


since in this case the direction field is discontinuous at every point of the 
curve, 
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24. Behavior of Integral Curves in the Large. Limit Cycles 


It is offen important to form a qualitative picture of the behavior of the 
integral curves of a di ferential equation “in the large,” 1e., of their behavior 
on the whole domain of definition of the direction Held, without worrying 

about the detailed geometry of the problem. For example, Figures 12- 16 
depict the behavior of integral curves near an isolated singular point. Let G 
be a simply connected domain on which the right-hand side of equation 
(3.50). (3 (3.50' ) is defined, and suppose every point of G ts an ordinary point, 
Then the family of integral curves can be represented schematically by a 
family of parallel line segments. since in this case an integral curve passes 
through every point of G ‘and no two integral curves intersect, 

The structure of the integral curves can be much more complicated if 
equation (3.50), (3.50°) has singular points or singular curves. One of the 
fundamental problems of the theory of differential equations is to find out 
(by the simplest possible means) how the family of integral curves of a given 
differential equation behaves on the whole domain G, i.e.. to study the 
behavior of the integral curves “in the large." A definitive solution of this 
problem is still not available even for equations of the form 


al dy M(x, ¥) 
he N(x, ¥) 


where A/(x, y) and N(x, 1) are polynomials of degree higher than two. 
Having made these general remarks, we now turn our attention to an 


See! 


important concept of the qualitative theory of differential equations, Le., 
that of the limit cycle. Consider the differential equation 


—— o— |, (3.57) 


where 2 and 9 are polar coordinates in the xy-plane.?? The complete integral 


* The adjectives “global” and “nonlocal” are synonyms for the phrase “‘in the large.” 
The branch of differential eqvenons wh ich deals with elodal behavior of integral curves is 
called the “qualitative” or “geometric” theory of differential equations, See eg. V. V, 
Nemytski and V. V. Stepanov, Qualitative Theory of Differential Equations, Princeton 
University Press, Princeton, NJ. (1960), and S. Lefschetz, Differential Equations: Geometric 
Theory, second edition, Interscience Publishers. New York (1962). 

In rectangular coordinates, (3.57) takes the form 


dy (xr yn P48 = 


He! Ae ace Poa 
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of (3.57) 1s 


{ * ay ¥ 
Sy) [ wcbee C 2¥ 
ad i c'," 


where C is an arbitrary constant and cannot take values larger than 
—In|C|if C < 0 (otherwise 9 becomes negative). 
The corresponding family of integral curves (see 
Figure 17) consists of 


1) The circle 0 = 1 (C = Q); 


2) Spirals emanating from the origin O and 
approaching the circle 9 = | from the in- 
side as 9 — —o (C < QO); 


3) Unbounded spirals approaching the circle 
o = 1 from the outside as o — — oo(C >- 0). 


A closed integral curve is said to be a /imit cycle Figure 17 

(of a given differential equation) if it consists 

entirely of ordinary points and if some other integral curve approaches it 
asymptotically.7! Thus the circle 9 = | is a limit cycle of equation (3.57). 
The search for limit cycles is a problem of great physical interest. 


Remark. The fact that every point of the circle 9 = 1 is an ordinary 
point of equation (3.57) can be verified by transforming from polar co- 
ordinates to rectangular coordinates. Thus a small neighborhood of any 
point of the limit cycle is in no way different from a small neighborhood of 
any other nonsingular point. 


Problem I. Show that a necessary and sufficient condition for a direction 
field defined on a domain G to be representable by equations of the form (3.51), 
(3.51’), where fs and N are continuous and do not vanish simultaneously, is 
that it be possible to equip every point of G with a unit vector which coincides 
with the direction of the field and depends continuously on position in the 
ficld. As before, the direction of the field at each point is specified by a line 
segment, but now (cf. footnote 2, p. 5) one of the directions of the segment 
must be chosen at each point and this direction must be a continuous function 
of position. 


Problent 2. Give an example of a direction field on a (two-dimensional) 
annulus which cannot be represented on the whole annulus by an equation of 
the form (2.21), where Afand Nare continuous and do not vanish simultaneously, 
but which nevertheless varies continuously on the whole annulus. As usual, we 


* Another definition of a limit cycle encountered in the literature is the following: 
A closed integral curve L is said to be a limit cycle if it consists entirely of ordinary points 
and if there are no other closed integral curves in some strip containing L. 
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represent the direction field at every point by a line segment, without dis- 
tinguishing between the two directions of the segment, and in saving that the 
direction field varies continuously on the annulus we mean that this segment 
varies continuously. Can an analogous example be constructed for a simply. 
connected domain in the plane? | 


FiGureE 18 


Problem 3. Suppose the integral curves of equation (3.51), (3.51’) have the 
form shown in Figure [8. Prove that if Af and N are continuous, then they must 
vanish everywhere on the segment 2B, 


Problem 4, Prove that the number of integral curves of equation (3.51), 
(3.51) which can enter an isolated singular point must either be even or infinite. 
Prove that if any integral curve approaches an isolated singular point along a 
spiral winding around the point an infinite number of times, then every other 
integral curve must approach the point along such a spiral. Can precisely two 
such spirals approach an isolated singular point? 


Problem 5. Sketch the behavior of the integral curves of the equation 


dy x ‘ , 
ee a eed 
dx V 


in the whole plane. Show that there is a focus at the origin and a limit cycle 
with equation x? + y° = 1. 

Hint, Compare the slope of the integral curve of this equation with the 
slope of the integral curves of the equation 


at the same points. 


Problem 6, Sketch the behavior of the integral curves of the equation 


dy p< ye ( " ( 12 2 ( i) l 
oe eee =: \' ea xX -_ — x Ena ta -- P— x ‘eam a. cn 
hr toe : ) 3 3 


in the whole plane. Show that there are two singular points, a saddle point at 
(0,0) and a focus at (1, 1). 
Hint. Compare the slope of the integral curves of this equation with the 
slope of the integral curves of the equation 
ae ene 


dx yox 


(which is easily solved by quadratures) at the same points. 
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Problem 7. Prove that if a closed integral curve L consisting entirely of 
ordinary points can be included in a strip containing no other closed integral 
curves, then £ is a limit cycle. 


Problem 8 Prove that if a closed integral curve L containing no ordinary 
points is approached asymptotically from the inside and from the outside by two 
integral curves, then L can be included in a strip entirely filled with integral 
curves approaching L asymptotically. 


Problem 9, Construct an example of a closed integral curve ZL without 
singular points which is not a limit cycle, where no neighborhood of L is 
entirely filled with closed integral curves. 


Problem 10. Prove that if'a direction field has no singular points inside or on 
a closed curve £ with a continuously varying tangent, then the direction of the 
ficld coincides at least twice at points of L with the direction of the tangent to L 
and at least twice with the direction of the normal to L. 


Comment. In particular, this implies a theorem of Bendixson which states 
that there must be at least one singular point of the field inside a closed integral 
curve, 


Problem 11, Prove that every integral curve of the equation 


dy 7 : 
— = an an o.. 


dx 
has at least one inflection point and intersects the line y = x at least once. 
Hint. Use Prob. 5, Sec. 2. 


25. Equations Not Solved for y’ 
We now consider first-order differential equations of the general form 
F(x, yy) = 9, 
rather than of the special form 


am f(x, A: 


Example. The equation 


yt—1=0 (3.58) 

is equivalent to the pair of equations 
bg == +], (3.59) 
y=, (3.59’) 


Every direction in the field corresponding to (3.59) makes an angle of 45° 
With respect to the x-axis. while every direction in the field corresponding 
to (3.59') makes an angle of 135° with respect to the x-axis. Therefore, given 
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any point (Xp, Vp) Of the xy-plane, there 
is one and only one integral curve of 
‘ Hae passing through (xo, y) [a line 
slope -+- 1] and one and only one in- 
yi ral curve of (3.59’) passing through 
(3.59’) [a line of slope —1]. The ae 
rection field of (3.58) is obtained by 
superposition of the direction fields of 
(3.59) and (3.59"), and as a result, there 
are two and only two integral curves 
of (3.38) passing through each point 
Ficure 19 (Xa. Ya) Of the xy-plane, as shown in 
Fieure 19 
This example is a special case of the following general 


THEOREM. Given a difjerential equation 
Fey) =: (3.60) 


suppose F(x, y, v") is such that 
1) F(x, 24") is defined and continuous on a closed bounded domain G 
in (N,V, ¥°) space: 
2) At some point (Xo. V9) of the xy-plane, equation (3. 60) has a finite 
number of distinct roots b,,...,0,, when solved for y’; 
3) Each of the points (Xo, 9, 0;), i= 1...., 4 belongs to G and has 
a neighborhood R, in which py yy) fas continuous partial 
derivatives F(x, y,y') and F(x, y,y'), where |F,(x, y.y)| > 
e > 0; 
Then there is a neighborhood A” of (Xo, Yq) such that precisely m solutions 
of (3.60) pass through each point of A’. 


Proof. Under these assumptions, it follows from the implicit 
function theorem that each of the points (Xy, Vo, ;) has a complete 
neighborhood R,; in (x, 1. y’) space (cf. footnote 34, p. 66) in which 
(3.60) has one and only one solution of the form 


=f(xy),  F=1,....m, (3.61) 


** By the intermediate value theorem for derivatives (see ¢.g.. T. M. Apostol, op. cit. 
p. 94), if (x) has a derivative ¢{x) everywhere in an interval (a, b) and if ¢(x) takes the 
values y, and y, at points x, and x, in (@, 6), then o(x) takes every value between y; and ys 
at some point of (x,, x.). Hence there is no function yx) whose derivative y(x) exists for all 
x and takes only the values +1, such that y'(x) = +1 for some values of x and y (x) = -! 
for other values of x. The fact that there are no smooth integral curves other than those 
mentioned is also obvious from F igure 19, 
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where /(x. 4) 1s continuous in x and has a derivative with respect to y 


equal to 7 Fy f) 

FG, Yh) 
Since [f, (x, ¥,f;)| = ¢ = 0, the quantities (3.62) are bounded. The 
neighborhoods Ry. ....,, can be represented as cylinders with 
generators parallel to the y-axis, where the 
projection of the base of each cylinder onto 
the xy-plane is the same neighborhood 4° 
of the point (Xp, Vo}, as shown in Figure 
20 for the case m= 2. This neighborhood 
“/ can be chosen so small that there is no 
point (vy, ¥, 4) of the surface (3.60) either 
above or below .#° which does not belong 
to one of the surfaces (3.61). In fact, 
if such points existed for arbitrarily small 
AS then, since F(x, y, ¥) is continuous 
and G is closed and bounded, they would 


Sabon (3.62) 


| 
i 
| 
' 


have to lie outside the cylinders Ay... .. K. 

on the line x = X9, y =}. But this is im- NM! Asa io) 
possible, since then (3.60) would have more as 
than 7 roots when solved for y" at the point Eicure 20 

(Xo, Yo) . 


Thus we find that the point (vy, ¥9) has a neighborhood (in the xy- 
plane) where equation (3.60) has precisely #2 solutions (3.61). Each 
function f(x, ¥), = 1,.... a1 is continuous in x and has a bounded 
derivative with respect to y. Therefore each of the equations (3.61) has 
one and only one integral curve passing through any given point of 1. 
Since the )" are all different in f°, these integral curves are all different 
and no two make contact without intersecting. Therefore, as asserted, 
precisely #1 integral curves of equation (3.60) pass through each point of 
vf (cf. footnote 42, p. 76). 


Obviously, none of the directions of the field defined by (3.60) is soe lel 
to the y-axis, and hence none of the integral curves of (3.60) can have 


tangents parallel to the y-axis. However, just as in the case of equations solved 
for y’, we would like te include the possibility of directions parallel to the 


yeaxis. Therefore, besides the equation 


. dy’ a8 
! (x SP} = 0. (3.63) 
aX: 
we shall sometimes consider the equation 
‘ dx. ” “af 
F{s } ) = Q, (3.63’) 
ai 
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woud 


where the function F, is chosen in such a way that the equations (3.63) and 
(3.63) are consistent. It is sometimes more convenient to combine. (3.63) 
and (3.63’) into a single equation written in terms of differentials (see Example 
1 below). Thus, just as in Sec. 2, in addition to solutions of equations of the 
form (3.60), we shall consider integral curves of the pair of equations (3.63) 
and (3.63’), or more concisely, of the equation (3.63), (3.63’). 

Now suppose the equation (3.63), (3.63’) is defined on a domain G,,,, 
(or G,,.-] in (x yy") space [or (x.y, x") space] and possibly on some or all of 
the boundary of the domain. Let G,, be the domain in the xy-plane where 
equation (3.63). (3.63’) defines a direction field. Then a point P in G,, is 
said to be an ordinary point of equation (3.63), (3.63’) if there is a neighborhood 
4° of P in the xy-plane such that the same finite number of integral curves, 
equal to the number of directions specified by (3.63), (3.63’) at P, passes through 
each point of .f. These curves are obtained by superposition of families of 
integral curves of equations of the form (3.50), (3.50°), where at least one of 
the functions f(x, 7). A(x. 3) 1s continuous on 1°, Otherwise, P is said to be 
a singular point of equation (3.63), (3:63 ). 

According to the theorem just proved, the following are sufficient con- 
ditions for P to be an ordinary point of (3.63), (3.63"): 


a) Given any set F in the xy-plane, let (£),,. [or (F),,..-] be the set of all 
points (xv v.3") for (x. s.x")] such that (x.y) belongs to E and 
F(x, y,") {or F(x, ¥. x")] is defined. Then the point P = (Xg, J'9) has a 
closed neighborhood ..1” such that G3,,- = (4 )eyy- [or GE = (A ae] 
is closed and bounded, and F(x. y. ¥") [or Fy(x, y, x”)] is continuous on 
Gr lorG. al. 


b) The number of integral curves specified by equation (3.63), (3.63/) at 
the point P is finite. 
c) For each of the directions specified by (3.63) [or (3.63’)], the function 
Fix. yy’) for Fy(x. yx") satisfies condition 3 of the theorem (on 
p. 76) at the point P.*° 
Thus, if F and F, are sufficiently smooth, the singular points of equation 
(3.63), (3.63’) must either be boundary points, or else satisfy one of the systems 


ee Ve y) = 0, Ff o(X, y y) = Q. (3.64) 
Pyx, Jy; oF on 0, Folks J; ae == Uy, (3.64') 


where it is assumed that conditions a and 6 are met. 

Finally. we define singular curves and singular integral curves just as in 
Sec. 23, except that we now start from the concept of a singular point of 
the general equation (3.63), (3.63'), rather than of a singular point of equation 


* In the case of F,(x, y, x’), the roles of x and y, and of y’ and x’, must be reversed in 
the statement of the theorem. 
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(3.50), (3.50') which is a special case of (3.63), (3.63). The examples of 
singular points, singular curves and singular integral curves considered in 
Secs. 22 and 23 still apply here. We now present two further examples, based 
on equations of the form (3.63), (3.63’). 


Example 1. Consider the equations 


yeh — x°) — x? = 0, (3.65) 
(1 ee") ore a0, (3.65’) 

which can be written im the more symmetrical form 
(lh = x*)\(dp)*® — x*(dx)* = 0. (3.66) 


Equation (3.66) defines a direction field only in the strip |x| < 1. The left- 
hand side of (3.65) is continuous and has continuous derivatives with respect 
to y and yp’ everywhere in this strip. [ts derivative with respect to y" is 
sf — x*) and vanishes only if x= +I or y =0. Because of (3.65), 

= 0 implies x = 0. The derivative with respect to x’ of the left-hand side 
of (3.65) vanishes on the same lines (and nowhere else), and hence (3.66) 
can have no singular curves other than the three lines 


y=—1l, x=0, x= +] 
The lines x == -+1 are singular curves, since they form the boundary of the 
domain where (3.66) defines a direction field. Moreover, it is clear from 
(3.65’) that the lines x = -+1 are integral curves. 


Next we show that line » = 0 Is also a singular 
curve (but not an integral curve). First we observe 
that (3.66) implies 

dy x 
ee 
dx vl—x* 
and hence all circles of een | with centers on the 
V-axis are integral curves of (3.66) [note that all these 
circles are tangent to the lines + = -+-1]. It is then easy 
to see that the line x = 0 is a singular curve. In fact, 
on this line equation (3.65) has only one solution 
3” = 0, while equation (3.65’) cannot be solved for x’ at 
all, But there is no point Bon the y-axis with a neigh- 
borhood .4° such that one and only one integral curve 
passes through each point of .4°, since there are four 
Integral curves passing through & itself, ne. the curves A, Bly, AgBAg, A,BA, 
and A,Ba4, shown in Figure 21. In other words, the y-axis is a singular 
curve, but obviously not an integral curve. On the other hand, if P is any 
point of the strip — 1 <x << | which does not lie on the y-axis, then two 


FIGURE 21 
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and only two integral curves pass through each point of some neighbor- 
hood of P. | | 

Finally we note that besides the integral curves already mentioned, 
equation (3.66) also has integral curves of the form 


P,B P,P, BaP oP), P,B,P,BP,OP,BP,, 


and so on. Thus aie many curves pass through any point of the lines 
x= +1, and he ce these lines might be called “boundary integral curves of 


ss 


NONUNIqUENESS 


Example 2, Consider the equation 
F(x, yy") = ¥ — xy — fO") = 9, (3.67) 


known as Clairaut’s equation. We shall assume that /()") 1s sega yen 
toge i with its first and second derivatives on a closed interval a < y' <b, 
and t rat f"(s") has constant sign (say nega tive) on this interval. Tien for 
ri x and y equation (3.67) has no more than two roots when solved for 
y’ (why?). Therefore it Is easy to see that (xg. 9) is an ordinary point of 


(36 7) 
1) The equation 
yo — MY —fO") = 0 (3.68) 
is satished for at pia one value of y" in the open interval (a, 6) but 
not for y =aory =): 


2) F(X. Vo ¥) = — Xo — f'(") = 0, where y’ is one of the roots of 


The points which do not satisfy condition 2 form a curve, which, in terms 
of the parameter p = y’, can be written as 


x=—f(p), y=axp+f(p) (3.69) 


x=—f(p), y= —f (pp + fp). (3.70) 


These equations define y as a function of x. To see this, we need only solve 
x == —f(p) for p [which is possible, since f"(p) te constant sign], and then 
substitute the result into the equation for y. Clearly, the solution of (3.70) 
is an integral curve. In fact, it follows from (3.70) that 


dx = —f"(p), 


dy = [—f"(p)p —f(p) + fp) dp = —pf(p) ap 


and hence 
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j.c., the solution of (3.70), or equivalently of (3.69), has the slope given by 
(3.67), Because of (3.71), if p increases, so does dy/dx. Moreover, (3.70) 
shows that 
dx 
mr 
dp 


f'(p) < 9. 


Therefore the curve (3.70) is concave up- 
ward, as shown in Figure 22, where the 
curve (3.70) is indicated by AQB. 

It is easily verified that the line 


0 
if 


yoex + f(e) (3:72) Figure 22 


is an integral curve for any constant ¢ (a < ¢ < 6). Clearly, this line is 
tangent to the curve (3.70) at the point 


x= —f'(e), y= —f(ele + fle). 


Moreover, every tangent to the curve (3.70) is of the form (3.72) [why ?]. 
Therefore equation (3.68) has as many roots (when solved for y’) as there are 
tangents to the arc AQB passing through the point (xy, 9). 

We now draw the tangents 1J/ACD and NBCE to the arc AQB at its end 
points A and B&. Together with AQ, these lines divide the whole xy-plane 
into five domains 


MACE, NBCD, AQBCA, ECD, MAQBN. 


There is one and only one integral curve tangent to the arc AQB passing 
through each point P inside the angles MA CE and NBCD, and hence 
equation (3.67) has one and only one root }” (distinct from @ and }) at each 
such point P, which is therefore an ordinary point. In other words, every 
point P in these domains has a neighborhood .4° such that the number of 
integral curves of (3.67) passing through each point of 4° is the same as the 
number of roots of (3.67) at P, i.e., exactly ove. This unique integral curve is 
a segment of the tangent line to AQB drawn through P. In the same way, we 
find that every point inside the domain AQBCA is also an ordinary point 
of (3.67). In fact, every point P in this domain has a neighborhood .1” such 
that two and only two integral curves pass through each point of 1, where 
the integral curves are again segments of tangents to 4Q8 drawn through P. 
On the other hand. every Interior point of the domains WAQBN and ECD 
has a neighborhood containing no integral curves of (3.67), and hence the 
points of these domains are classified neither as ordinary points nor as 
singular points, 
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Of all the points in the xy-plane for which equation (3.67) has roots when 
solved for y", only the points of the lines 74CD and NBCE fail to satisfy 
condition 1 (on p. 80), and only the points of the curve AQB fail to satisfy 
condition 2. It is easy to see that these curves are singular curves of equation 
(3.67). 

Finally we note that besides the integral curves mentioned above, equation 
(3.67) has further integral curves like the curve SOBN, consisting of parts of 
tangents to 4Q8B and part of the are 4 OB itself. 


Problem I, Determine the form of the integral curves of the following 
equauions: 

a) sins’ = 0; b) siny’ = Xx. 

Problem 2. Consider the equation F(x, y’) = 0. Prove that if the curve 
F(x, 2) = 0 has a vertical tangent at x = ¥9, 2 = 29, Which does not intersect 
the curve, then 

1) There is an integral curve of F(x, ) = 0 passing through each point 

(Xo, ¥p) : 

2) The integral curve has a cusp at (X9, )). 

Make a similar study of the points where the curve F(x, z) = 0 has a) maxima; 
b) cusps: c) self-intersections; d) vertical asymptotes. Analyze the following 
equations 


ea yy Sa ye); b) x(l + yp) = yt 


Problem 3. Solve the Clairaut equation 
Dee 9 tia Yea, 
Why does the family of integral curves have a different appearance from that 
of Figure 22? 


Problem 4. Investigate the family of solutions of equation (3.67) if f°’), 


too 


a < y’ <b vanishes a finite number of umes. 


Problem 5. One method of solving the system of equations 


dy | 
ae : Pareion Pana €) (3.73) 


Where / is a parameter, is to differentiate the first equation, which leads to 
rit) 37, 
= ft) i om Ge (3.74) 
' o(7) 


Then (3.74) together with y = f(r) gives a parametric representation of the 
family of solutions of (3.73). Give sufficient conditions for the validity of this 
method. 
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26. Envelopes 


Suppose the differential equation F(x, y, ») = 0 has a family of integral 
curves 


P(x, y, C) = 0 (3.75) 


covering a closed bounded domain G in the xy-plane in the sense that at 
least one curve of the family (3.75) passes through each point of G, but only 
finitely many curves pass through any point of G. Consider the problem of 
finding a curve ZL lying in G such that 


1) L is tangent to some curve of the family (3.75) at each of its points; 


2) Infinitely many curves of the family (3.75) are tangent to each arc of 
| 


Such a curve L is called an envelope of the family (3.75). Obviously, the 
envelope ZL of a family of integral curves ts itself an integral curve since at 
each of its points, L is tangent to an integral curve and hence has one of the 
field directions. We shall assume that the function P(x, y, C) has continuous 
derivatives in all its arguments. Further assumptions, indicated in italics, 
will be made later. 

Suppose an envelope L exists. Then ZL is tangent at each of its points 
(x, y) to a curve L,-, where the index indicates the value of the parameter C 
for which the curve L,, is obtained from the general equation (3.75). Therefore 
the coordinates of the points of L sausfy the equation 


(D[x, Ms CO ¥)] =O; (3.76) 


where now C is no longer a constant, but varies along the curve L, taking the 
value of C corresponding to the variable Z,.. We shail henceforth confine 
ourselves to arcs of L on which y is a@ differentiable function of x. Then 
C(x, ¥) in equation (3.76) can be regarded as a function of x only, so that 
(3.76) becomes 

Dix 93, Cx) = 0. (3.77) 
The function C(x) is regarded as known, and moreover it is assumed that 


C(x) is differentiable and has no intervals of constancy (i.e., no intervals on 
which it reduces to a constant). 


** Curves of the family (3.75) are considered to be distinct if they correspond to distinct 
values of C. Regarded as geometric objects, two curves of the family (3.75) may coincide on 
some arc, However, because of our assumptions, it is impossible for all the curves of (3.75) 
tangent to an are of an envelope of (3,75) to coincide on the are. 

* Arcs on which x is a differentiable function of y can be considered in just the same 
way, 
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We now find the value of y” for the function (x) satisfying (3.76), Thi 
can be done in two ways. First we can differentiate (3.77) with respect bx 
obtaining 

eo ad , , a 

dx oy 8c 
On the other hand, the value of »” corresponding to the curve Lp passing 
through the same point (x, ) is found by differentiating (3.75) with respect 
1O:x: 


C= 0. (3.78) 


Boe aay Us (3.39) 

ax ay 
Therefore in order for the values of y’ given by (3.78) and (3.79) to coincide 
(y’ can be determined from these equations provided that @®/dy + 0), ie. in 
order for the curves (3.75) and (3.76) to have a common tangent at the point 
(x, ¥), it Is necessary that 


<c'=0. (3.80 


It follows from (3.80) that at least one of the factors @@/dC and C’ vanishes. 
But C is assumed to have no intervals of constancy, and hence C’ cannot 
vanish on any interval. Therefore ihe envelope satisfies the equation 


— = 0, (3.81) 


together with (3.77). 

It is easily verified that conversely, if the equations (3.77) and (3.81) 
[where D(x. ¥, C) has the properties assumed above] determine a pair of 
differentiable functions }{x) and C(x), such that C(x) has no intervals of 
constancy, then y = y(x) is the equation of an envelope of the family (3.75). 


a 


Remark I, Since x and y play identical roles in the statement of the 
problem, their roles can be reversed in its solution. 


Remark 2. An envelope L of a family (3.75) of integral curves. of a 
differenuial equation F(x, y, p’) = 0 is always a singular integral curve of 
the equation, In fact, L is an integra} curve, as already noted, and moreover 
all the points of L are singular. since infinitely many integral curves pass 
1 ; * . eos s « ” why) 
tnrough any point P of L in an arbitrarily small neighborhood of P (why?) 

Example 1. The family of curves 

a , eg: 29) 
M(x, y, C) == ys — (x + Ci) = 0, (3.82) 


defined on the whole xy-plane, consists of the cubical parabolas. obtained 
by subjecting one such paradola, say y = x4, to shifts parallel to the 1-akh, 
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Setting dP/AC equal to zero, we find that —3(x + C)* = 0 and hence C = 
—x. Substituting this value of C into (3.82), we obtain the line y = 0, which 
is obviously an envelope of the family (3.82), as shown in Figure 23. Note 
that if we had written the equation of 

the family in the form yi 


D(x, y, C) = y8 — (x + C) =0, 


then @/8C would equal —! and 
our method would give no envelope, 
although one actually exists. This is 
due to the fact that now d/dy does 
not exist for py = 0. 


FIGURE 23 


Example 2, Consider the family of curves 
P(x, y, C) = yy? — (x + C= 0, (3.83) 


defined on the whole xy-plane. Setting d@/dC equal to zero, we find that 
—3(x + C)? = 0 and hence C = —x. 
y Substituting this value of C into (3.83), 
we obtain y= 0. But the x-axis is 
obviously not an envelope of the family 
(see Figure 24). This discrepancy is due 


0 4 to the fact that 
ap. , 
— = 5)" =0 
oy 
FIGURE 24 
for y = 0. 
Example 3, The family of circles 
Pix, y, C) = xw+O + CP -— 1 =0 (3.84) 
covers the strip between the lines x = +1. Setting dD/dC equal to zero, we 
find that 2(y-+ C) = 0 and hence C = —}. Substituting this value of C 
into (3.84), we obtain x = +1. Each of these lines is an envelope of the 
family (3.84) [recall Figure 21]. 
Example 4. The equation 
ym Coxe + 2C*x — C = 0, (3.85) 


Which can be written in the form 
; a l : 
, aes c XxX —_ akan ¢) 
C 


if C 0, defines a family of parabolas with axes parallel to the y-axis and 


86 GENERAL THEORY CHAP. 3 


vertices on the x-axis. The x-axis is obviously an envelope of this family. 
Moreover, the x-axis is itself a curve of the family, as can be seen by setting 
C = 0 in (3.85). | 


Problem. Use the general method of this section to find the envelopes in 
Example 4. What is the geometric meaning of the resulting hyperbola? 


Part & 


SYSTEMS OF 
ORDINARY DIFFERENTIAL EQUATIONS 


A 


GENERAL THEORY ‘OF SYSTEMS 


27. Reduction of an Arbitrary System to a System of 
First-Order Equations 


Consider the system 


ils a dy amt yy ee ; 
lie ee ere er Wnt a ee =e Or | 
dx ix” dx (4.1) 


fe Pee ae 


Where each equation contains the independent variable x. 7 unknown functions 
Vye.e +5), OF x, and derivatives of each y, with respect to ¥ up to some 
order #;. To reduce the system (4.1) to a first-order system, we write vy, =) 
and 


(i) 
dy; . 
nt — _ aad Oe PO | 7 Pee (4.2) 
ax 
Then (4.1) takes the form 
fy Gs) jy lita LD) 
(Oh tL) ay dy (oO) | {1) (imal) GV n 
«| vy Vi V1 . * * ce * a a a Va ’ vx eee ny : _ * — i= QO, 
: dx adx : 


PS 1s ci kg he. A? 


Thus, given # functions y,(x), i= 1....,m satisfying the system (4.1), we 
obtain a set of functions (x) which satisfy the system of first-order 
differential equations consisting of (4.2) and (4.3). Conversely, given a set 
of functions y@!(y) satisfying (4.2) and (4.3), itis easy to see that the functions 
wy) d= 1,...,a satisfy the system (4.1). In fact, consecutively setting 


89 
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k=0,1,...,m; — 2 in (4.2), we find that 


_ eve +1 ae 


+ 
me 


dX 
and then (4.3) implies (4.1). 

Henceforth we shall be concerned mainly with systems of first-order 

differential equations which are solved with respect to the derivatives. 
Problem. Prove that a system of first-order differential equations of the form 

fix, Visee- re ee se eh) = 0, js L aw yak 

can “in gencral” be reduced by consecutive differentiation and elimination of 
superfiuous variables to a single diferential equation of order & involving one 
unknown fenction, whose solution allows the other unknown functions to be 
determined without further integrations. The phrase “in general’ means that 
we assume the possibility of solving all relevant systems of finite (nondifferential) 
equations in which the number of equations equals the number of unknowns, 


It is assumed that the functions /; have continuous derivatives of ail required 
orders. 


28. Geometric Interpretation. Definitions 


Consider the system of differenuial equations 


iy: 
ms Sie. Patuca (4.4) 


Where the functions f(x, ¥,,...,.¥,) are defined on some domain G in 
(X,¥1,---,J',) space.) A set of functions 


bi Go eememan oae es (4.5) 
satisfying the system (4.4) will be called a solution of the system. The equations 
Y=), f= 1,...,0 (4.6) 


define a curve in (x, }4,..., 4.) space called an integral curve (in fact, a 
“ graph”) of the system (4.4). Instead of saying that y,(Xy) = ye. = 1. 
we shall often say that the curve (4.6) or the solution (4.5) “passes through” 
the point (x, y?,... , ¥°). The set of functions 


De VAN Cp cay, are beset). send (4.7) 


Involving parameters C,,..., C,, is called the general solution of the system 
(4.4) in a domain G if every solution of (4.4) lying in G is given by (4.7) fora 
suitable choice of C,,..., C,,. The most common case Is “where nt == 1. 


'Le., the space of POMS CO ones 9): 
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The fact that the functions y,(x) satisfy the system (4.4) means geo- 
metrically that the tangent Hine / to the integral curve (4.6) at the point 
(x, ¥i(X), ..- ¥a(%X)) Is given by 

Y, — yx) 
X —-xXx 


where X and Y, denote running coordinates along /. Therefore the problem 
of solving the system (4.4) has the following geometric interpretation, which 
is the n-dimensional analogue of that given in Sec. 2: Suppose that through 
every point (x, 1,.-., ¥,,) of a domain G we draw a short line segment" 
whose direction cosines are proportional to the quantities 


Py ser eer On eee es iy Rarer 2 


thereby specifying a set of directions in G, called the direction field of (4.4). 
Then finding an integral curve of the system (4.4) means finding a curve whose 
tangents have directions belonging to the given direction field. 

If the direction field is specified in this way, none of the integral curves 
can lie in a plane parallel to the plane x = 0. The artificial character of this 
restriction is revealed by the following considerations: Starting from a 
given direction field in (x, ¥;,..-..3',) space, we look for smooth curves® 
(called integral curves of the field) whose tangents have directions belonging 
to the field. In general, the equations of these curves cannot be written in the 
form (4.6), solved with respect to the unknowns y,,...,¥,, since planes 
parallel to the plane x = 0 may intersect the curves several times or even 
contain entire arcs of the curves. However, differential equations of the curves 
(or equivalently, of the corresponding direction field) can be written down 
by regarding x and the v, as (continuously differentiable) functions of a 
suitable parameter /, ¢.g., the arc length along the integral curve or the time 
required to traverse the integral curve from some fixed point on the curve to 
a variable point (xX, ¥,,...,.¥n). Then we have 


= filx, Cx), ~~ 6s FAC), bes deh tone Wy 


— Ge Vis eens Mab i= I, caaandhy 
(4.8) 


where the functions /* and f* cannot all vanish simultaneously. The direction 
field represented by (4.8) can also be written in the form 


' iv dx PF 
J 1 (X, Mises vad fils, Misc 1) J (x, Le Ce ¥,) 


* Again we make no distinction between the two directions of the segment. 
* Cf. Remark 1, p. 6. 
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On separate arcs of the curves, it may be possible i choose one of the co- 
ordinates x. Vy, ....¥, a8 the parameter. For example if f(x, ¥4,...,¥,) is 
ali in some eee G, we can solve (4.9) for dx|dy, and dy,/dy, 
(i=l... k-WA+1,....0). Le, we can use y, for the parameter, 
Similarly. it f(r ee a is nonzero in G, we can solve (4.9) for the 
dy Jdx, and then x plays the role of the parameter. 

The system (4.8) is of the form (4.4), except that the number of unknowns 
is one larger. Hence there is no loss of generality in confining ourselves to 
systems 0 - form (4.4). Such systems obey theorems analogous to those 
proved 1 in Chapter 3, and in most cases the proofs are so similar that there 
is no need to repeat them in detail. Thus we shall merely state results, except 
in the case of Osgood’s theorem and the principle of contraction mappings, 


oe 
ree 
of 


Remark, A system of equations 


determining an integral curve of the system (4.4) is called an integral of (4.4). 
Moreover, a system of equations 


D Rite Mae Cinea ac a) oO, 2 eee (4,10) 


involving parameters C,,..., C,, is called a complete integral of the system 
(4.4) in a domain G if every integral curve of (4.9) lying in G is given by 
(4.10) for a suitable choice of C\,...,C,,. 


Problem 1, Construct a system of two first-order differential equations of the 
form (4.4), with unknown functions y and z, whose integral curves are all 
right-handed helices with a given pitch # and the x-axis as common axis. How 
can this problem be generalized to a larger number of dimensions? 


Problem 2, Give necessary and sufficient conditions for a direction field ina 
domain G to have a representation of the form (4.9), where the denominators 
are all continuous and do not vanish simultaneously. In what domains can any 
continuous direction field be represented in this form? 


29. Basic Theorems 


THEOREM | (Peano’s existence theorem). If the functions fAX, V1. ++++¥n) 
are continuous on a domain G in (x, Vy ee a Hy) Space, then at least 
one integral curve of the system: (4.4) passes through each interior point 


(ty, vi eee a | .¥,) of G. 


Fre pee 5 
Proof. be as In Sec. 11, we construct Euler lines L, passing through 


- af) 
(Xo Vs -... 37) and then pass to the limit |Z,|-—~0, using Arzela’s 
theorems 


emesis 


‘For the meaning of |L,1, see p. 37. Also recall Remark 3, p. 32. 
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THEOREM 2 (Osgood’s uniqueness theorem). Suppose the functions 
LAX Vis es Vy) Satisfy the conditions 


n 
a ‘a ne * * “ae ae E 
[fils Pry ee es Pa) Li Ve VL af 219, — vil), 


vio] 


(4.11) 
eee 
for every pair of points (x, Jy,...,3,), (4, yR,.... *) in a domain G, 
where o(u) > 0 is a continuous function on 0 <u < a and 

‘a du 


lim ee) 
ear *? (u) 


Then there is no more than one integral curve of the system (4.4) passing 
ee ee 0 
through each point (Xo, Vy, ~-- +92) of G. 


Proof. The most common choice ts o(#) = Ku. In this case, (4.11) 
becomes 


n 
i Iu BDAC yh wD KI ~ vib 
inc] 


° (4.12) 
P=1,...,M, 
and is known as a Lipschitz condition Since the proof of Osgood’s 
theorem is somewhat more complicated for a system of differential 
equations than for a single equation, we now supply the details. 
Suppose there are two distinct solutions ¥,(x).....3,0%) and 
yy (x), . . . * (x) of the system (4.4) such that 


t 
TiS ye (Xp). (2 2 eee |e (4.13) 


Since the solutions are distinct, there is a number x, such that 
i 7 
> LyX) ~— yp (xp] > 0. 
i=l 


There is no loss of generality in assuming that xX, > Np, since otherwise 
we need only replace x by —x. Despite the fact that the functions ¥,(4) 
and y"*(x), and hence the differences y,(v) and ¥¥(x), have derivatives, 
the absolute values of the difference y,(v) — yi(x) will fail to have 
derivatives at all points where 
ne de al : ree ae 
vids) — 7) = 0, SLs) — FO) = 0. 
dx 

This compels us to consider right-hand (or left-hand) derivatives at 
these points, instead of ordinary derivatives. As usual, by the right-hand 


ee 


> As we shall see in Sees. 30 and Si: if the functions ¥, ate continuous in all their areu- 
ments and satisfy a Lipschitz condition of the form (4.12), then existence and uniqueness 
can be proved by the method of successive approximations, just-as in the case of a single 
equation jy” = f(x, ¥). 
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and left-hand derivatives of the function 2(x) at.the point 4, we mean the 
quantities 


c(x + hr) — z(x) | . 2x +h) — 2fx 
D,2(x) = lim AGS ame .  D,z(x) = lim sx t=) 
i~O A i-0 
u>o A<O 


respectively. When it does not matter which of these derivatives is used, 
we will drop the subscripts r and /. It is easy to sce that if the derivative 
z'(x) exists, then so do both D, |z(x)| and D, |z(x)|, and moreover 


ID, le) 1 = 12, 12@))] = l2'@)], 


or more concisely 
ID iz) = le @). 


Taking all this into account, we use (4.11) to deduce from the identities 


d¥(x) ae eds 
Se ee See BN, oe ae eT 
dx 
dye(X) _ é3 
aie WO cccey Ce) Li dee 
dx 


that 
LD] Fi) — FOOT = LABS F100), F)) — SFL, yFOx), ..., y*QOI]. 


on 
<, a Z [FCx) ~~ oI), 
| be] 

and hence 


| D ») LF(x) — yF(x) i < no( Si, (x) — y% “(s)] 
to] vase] 

r (4.14) 
ules ap Pex) — cal). 
1 


The last step is possible only if 
te: 
“~ ' ™ 
> 1FCx) Ne (x)| > 
oe | 
which, in particular, is the case for x = x,. Setting 
Wi 
2 LC) — yr (x)] = 2(x), 
— 


we now Construct the solution of the equation 


OP 6s 
— = (4 + l)o(y) 

dx 
satisfying the condition y(x,) = 2(X) = z,. Such a solution exists and 


1S Li re % i. 2s 
nique (see Sec. 4), and its graph approaches the negative X-axis 
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asymptotically without ever intersecting the x-axis. By construction, 
the curves z(x) and y(x) intersect at the point (x,, z,), and the inequality 


IPQ] < (@ + Neola) = (a + DeobOy)] = yy) 
immediately implies the existence of an interval (x, — <,x,), = > 0 on 
which 

2(x) > y(%). 
But this inequality holds for every <, if 0 << < x,, since otherwise we 
immediately arrive at a contradiction by choosing < to be the largest 
value compatible with the inequality. In fact, we would then have 
D,z(%s) > y'(2) = (8 + Nol yaa)] = Cn + Dele(n)]. 
since z(x) > y(x) to the right of the point x,, while on the other hand, 
since Z(X2) > 0, (4.14) implies the contradictory inequality 
Dz(x2) < (n + Nole(s)]. 
It follows that 
2(x) > V(x) > 0 

for all x in the interval [x9, ¥,]. In particular, z(xy) > O contrary to (4.13). 
Thus there cannot be two distinct solutions of the system (4.4), and the 
proof is complete. 

THEOREM 3 (implications for a system of higher-order equations). 
Given a system of differential equations 


i111 {Mah 3 
(es eae guy 
Pe = f(s, Vees i me Tae naan’ ey 
dx dx a ec 
(4.15) 
ts Peers: 
solved for the highest-order derivatives of the unknown functions Ve uasas 


V,. suppose every f, is continuous in some neighborhood 4 of the sane 
ape" Nnwl ‘ {0° 
0 ™ a) 0 [eae ‘}) 
Nos Vi os 8 © 4 dymr} j se se wg Vas rr) dx™ al 


and satisfies a Lipschitz condition in all its arguments except the first 
on AY, Then on some interval [a, b) containing X 9 as an interior point 


there exists one and only one solution yx)... .. VAX) of the system 
(4.15) such that the functions 
= | inal 
qm Ay (x) P hie y,(x) 
y,(x), 0... a). 


ayn! dxtien?t 


take the values 


mt, WW f mq—t,, 0 

0 al? ' as] 0 d \ :| 
bi nanamnmnninnene ett ’ cee ine 
4 {es te # © § ani | eS ee bs a metry pee: 


Ldx 
at the point x = Xo. 
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Proof. Use Theorems 1 and 2 and the considerations of Sec. 27. 
Just as in See. 11, the solution on [a,b] can be continued in either 
direction. 


THEOREM 4 (Cauchy's theorem). If every function [(x, Vigewns Vs), 
i=1,....2 is analytic in all its arguments in a neighborhood of the 
point (Xq. V2... ..¥), then the system (4.4) has a unique solution y,(x), ..., 
y,(x) satisfying the initial conditions 


on ll 
Vy(Xq) = Ve gta an te V(Xy) eo ay 


and this solution is analytic in a neighborhood of Xo. 


Coroiiary. Jf every function fx. yi,.--,¥y), b= l,...,a fs 

analytic in all its arguments on a domain G and if every i CO Vin aiag,) 

is real for all real x, yy... ~.¥_q. then every real solution of the system 
(4.4) dying in G is analytic. 


THEOREM 5 (Smoothness of solutions). If every function fX, Vy... Yas 
i=1,...,a has continuous derivatives with respect 10 X, Vy... Vp 
up fo order p > O (inclusive), then every solution of the eee (4. 4) has 
continuous derivatives with respect to x up to order p +- 


THEOREM 6 (Dependence of the solution on parameters). Given 
a system 
dy; * 2 “ 4 ; fl 16) 
Be AN Vy ace Visas, 3), bSlesetally: Aaa 
ax 
let Cais y, be a closed domain in (x, V,,..., ¥,) space, let P be the 
open m-dimensional parallelepiped 


tu a Ws rors [ean = a 


me 


é 


(where the u° are positive constants), and let G = Gay <3 .y, *% P. Suppose 
every fUX Vy. Vas Maes 2) Aas bounded continuous derivatives 
With respect 10 yy... y Vas i‘ Lee thy, Up to order p-» O (inclusive) 
on G, and satisfies a Lipschitz condition in the variables Se Vy ONG 
whose constant is a of the PAYAMELETS Byes soy Verne Then, given 
any interior a OS sincase 3), ) Of Gz, gaere ea closed interval 
AQxX <b (a < xy < b)on which the system (4. Z has a unique solution 
tee als. iy eee, ak Vy = OX, yy ees By) taking the values 
Vek yn for x = x,. Moreover, ever pen ee ts ey Ly,) has continuous 
derivatives with respect to May... phy, Up to order p on the set [a, 6) x P. 


e Tl 

CoROLLary. If every f(X, Vy, 666 Vn) de continuous derivatives with 
respect {0 X, Vy... . 3, up to order p x» 1, then the pancious yx, x 

Jlvsre sy in) satisfying the system (4.4) and equal to Bie Pie i for v= xy 
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have continuous derivatives with respect to see re ae ve up to order p> 1. 
This assertion remains true for p = 0 if the f, are such as to guarantee 
uniqueness Of the solution passing through (xX, y},..., ¥°).° 


Problem 1, Verify Theorems | and 4 in detail, and also Theorem 6 and its 
corollary. State the implications for systems of higher-order equations. General- 
ize the results of Secs. 13 and 19, and also Remark 3, p. 32, to the case of 
systems. Also generalize Probs. 8 and |1 of Sec. 12, Prob. 4 of Sec. 14 and Prob. 
5 of Sec. 19. 


Problem 2. Prove that the assertion of Prob. 8, Sec. 19 generalizes to 


surfaces of the form y = f(x,,...,%,,), but not to curves in n-dimensional 
space (7 >> 3). 


Problem 3, Let every f,(x, y;,..., ¥,) in the system (4.4) be continuous, and 
suppose every solution of (4.4) satisfying the initial data stays inside the domain 
G when continued onto the interval a <x < 5 (a <x, < 5). Under these 
conditions, prove that if the initial conditions and the functions /; are subjected 
to “sufficiently small’ changes, then the varied solution can still be continued 
onto the whole interval [a, 6], where it is uniformly close to one of the solutions 
of (4.4) satisfying the original initial data. 


Problem 4 (UH. Kneser). Prove that under the conditions of Prob. 3, the 
intersection of the plane x = ¢ (a < ¢ < &) with the “integral funnel” (the set 
of all integral curves satisfying the given initial conditions) is nonempty, closed, 
bounded and connected (i.e., cannot be represented as the union of two non- 
empty closed disjoint sets). For 2 == 2 construct an example where the indicated 
intersection Is a) a disk; b) a circle. 


Hint. Consider the intersection of the plane x =e with the set of all 
Euler lines for systems with “neighboring” right-hand sides satisfying the given 
initial conditions, where the projections of the segments of the Euler lines onto 
the x-axis all have the same length. 


Problem 5, Generalize Prob. 4 of Sec. 11 to the case of systems of the form 
(4.4), and interpret the result geometrically (instead of a direction held, we have 
a field of “cones of directions,” where the cones are of a special form). Go from 
cones of a special form to arbitrary convex cones. Generalize the results of the 
preceding problems to systems of this kind. 


Problem 6. Give sufficient conditions for existence and uniqueness of 
solutions of an infinite system of equations 


dy; 


dx _ fix. Vie Vos cand ), i I S eb a 
ay 7 


Involving infinitely many unknown functions. 


* See the theorem of See. 19 and the remark on p. 38. 
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30. The Principle of Contraction Mappings for a Family of 
Vector Functions 


An ordered array 
Ax) = (r(X), «6 GAC) 


of 7 functions 9,(x), ..., 9,(x) is called a vector function (in 1 dimensions),’ 
and the separate functions 9,(x), ..., @,(¥) are called the components of 
o(x). By o(x) + o(x) we mean the vector function with components 
o,(x) & 6,(x),..-. 9,(¥) + 9,(x). A sequence of vector functions 


a(x) ba (oP ix), a ae o.'(%)), poo a 2, eae 


is said to converge (uniformly) as k —> 9 to a vector function 


AX) = (91(X), «+ > nl) 


if each sequence of components ¢9{*)(x) converges (uniformly) as k — © to 
o{x), f= 1,..., 0. Clearly, a necessary and sufficient condition for a sequence 
of vector functions @ (x) 10 converge uniformly to @(x) as k — 90 is that the 
numerical sequence 


n 
> sup |e;"'(x) — eCx)| 
i=1 xé}y 
converge to zero as k —> «& (why ?). 
A vector with only one component is often called a scalar. We now 


generalize the principle of contraction mappings proved in Sec. 15 for a 
family ® of scalar functions: 


| THEOREM (Principle of contraction mappings for a family of vector 
functions). Let D be a nonempty family of vector functions o(x), each 
defined on the same set E, satisfying the following conditions: 


1) Every function % is hounded, in the sense that 


ps 
> sup |o(x)| = M, < co 


il 2éE 
(in general, the bound depends on 9); 


2) The limit of any uniformly convergent sequence of functions in p 
also belongs to ®; 


3) There is an operator A defined on ® carrying every function 9 uN 


D into another function Ao = ({Ao\s,..., {Agy,) in D: 


OS ent 


7 “yt fe + f% wa} ae i . ° ; ac * al 
eer on, we shail deal with vector functions which are best thought of as “column 
: ate, : ae s . 
vectors” rather than “row vectors.” 
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4) The inequality 


tel Pi 
> sup Ae). — (49Q)}:1 <r D sup |o,(x) — 9,0) (4.17) 


holds for every pair of functions 9, 9 in Y and for some fixed m 
in the interval OQ < m <4. 


Then the equation 


Q = Ad, 
has one and only one solution in the family ®. 
Proof. The proof is word for word the same as in the scalar case 
(see Sec. 15) if we make the substitutions 


o, > oft? SUP [2,21 — Pel —> D sup [oY — of], 


n 
sup |Ag — Ag,_,|—> ¥ sup |{Ast, — [Ao PY, 
fs} 


and so on, bearing in mind the italicized assertion preceding the statement 
of the theorem. 


Remark. The geometric interpretation of the theorem is the same as in 
the scalar case (see Sec. 16), except that now the “points” are vector functions 
(curves) and the “‘distance’’ between two points 9 = (%,..-,9,) and 
o = (0,,...,9,) is given by 

Hu 
> sup [9; — 9l- 
t-2] 
Problem. Prove that the principle of contraction mappings remains true if 
the inequality (4.17) is generalized to 


F(sup |{Ag}, — {49}, 1,..., sup {ao}, — 42h) 
< mF(sup lo, — 9],.... Sup le, — Sal), 


Where F(r,,....%,) iS & continuous nonnegative homogeneous function of 
degree 1, defined for t, > 0,....%, > 0 and zero only at the origin. Give 
examples of such functions. 


31. The Method of Successive Approximations for 
a System of Differential Equations 


THEOREM. Suppose the functions fXiVi.- +: Va GS], JH 


appearing in (4.4) are continuous in x on a domain CF UPEN, Vie eee iis) 
space and satisfy a Lipschitz condition in the variables Vy..... Vy On 
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every closed bounded domain G' contained in G. Then, given any point 
(Xoo Pe...) in G, there is a closed interval la, b] containing xy as an 
interior noint on which the system (4. ka has a unique solution y,(x), .. a 
¥,(4) faking the values Vee. ee My for X = Xo. 


Proof. First we note that if such a solution y,(x),...,3,(x) 
exists, the — of the identities 
day(s) ae “ape eae , 
S) — fe, VS loxas-¢ aed: P= 1,...,1 
az 
between x and X, gives a system of integral equations® 


= 


nod =P +P AE i)... Ede, PS bee (4.18) 


39 
Thus every solution of the system (4.4) passing through the point 
(Xo. Vie },) satisfies (4.18). Conversely, every continuous solution 
¥i(x). 2. ¥,CN) of the system of integral equations (4.18) satisfies 
the system of differential equations (4.4) and the initial conditions? 


¥(Xo) ss Sf ee ¥y(Xo) = ue 

In fact, the initial conditions are obviously satisfied by the left-hand 
sides of (4.18), and moreover differentiation of both sides of each 
equation (4.18) leads at once to the system (4.4), provided the differ- 
entiation is legitimate. But the right-hand side of each equation (4.18) 
is Clearly differenuable for continuous y,(x),....¥,(x) [being an in- 
definite integral of a continuous function], and hence so is the left-hand 
side. Therefore. instead of proving that the system (4.4) has a unique 
solution taking the values 72... .. y for x = xy on some closed interval 
la, b) (a < xy < 6), we shall now prove that the system of integral 
equations (4.18) has a unique continuous solution on [a, b]. This 
will be done by applying the principle of contraction mappings for 
a family of vector functions. 

With this in mind, let G’ be any closed bounded domain contained 
In G with (vo, »?,..., 8) as an interior point, and let AW be the least 


upper bound of all the MINCUONS T(x, Tees. ce Val ik eee tO 
G’, Draw 2n planes 


‘ U : i . % 7 # 

Y;— ¥, = EM(x — Xp), re (4.19) 
through the point (x5, yf... .., 9), and then two planes x = a@and x = 
which together with the planes (4.19) form two pyramids ?, and Ps 
completely contained in G’, with common vertex at ae tineeds yi 


g 
Each integrand is continuous by an argument like that in footnote 9, p. 39. 
* Sce footnote 10, p. 39. 
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[later in the proof, we shall assume that a and 6 are sufficiently close 
tO Xy]. Now let £ be the interval [a, 6] and ® the family of all continuous 
vector functions’? 9(x) = (9,(%),...,%,(x)) defined on [a, b] whose 
graphs lie in G’. Then conditions | and 9 of the principle of contraction 
mappings are obviously satisfied. Moreover, Iet A be the operator 
carrying 9 into Ag, where 


(Ao), = yl + |" yf cane Co eee ml <4) a | 2 os) A 
ety 


[t is clear that @*(x) = Ag(x) is itself defined and continuous on [a, 6], 
and in addition satisfies the initial conditions 9¥(X9) = yq...., 2*(X) = 


yj. Moreover, the curve y, = of(x), f= 1,...,# cannot leave the 
pyramids P, and P,, and hence cannot leave G’, since 


FAN Ce Reo) | ae) a ee eee 
implies 
a(x) —- Pla M|x—xl, i=l,....m. 


* 


Therefore condition 3 of the principle of contraction mappings is 
satisfied. To verify condition 4, we use the fact that every /; satisfies a 
Lipschitz condition in the variables y,,..., ¥, lo write 


Y sup [fg he — {4 oH 


i=] 


7 


n : - : - ; : 
= Soup] (9? +] AE onl) + 2nl2)] 4) 


r=] : 


- 


— (08 +[F AE. 22)... 3.21 2) | 
0 ‘ 


* aw 


an “zt ke wat sad = baal a “x tee 
Pm ¥ sup | AIS ey 24 SH Slee iets Bal) dé | 
f=] ome, | 
nt "+ . 
< KY sup | fled’) — G+ +++ + bee) — S21] 
ies} al 
i a 
< Kn(b — a) S sup |; — 9,| = md sup |e, — Gal. 
Ft isl 


Where Kn(b — a) = mand m <1 if [a, 6) is sufficiently small. Therefore 
we can use the principle of contraction mappings to deduce the existence 
of a unique continuous solution of the system of integral equauons 
(4.18), or equivalently the existence of a unique solution yy(x), . ... V(X) 
of the system of differential equations (4.4) satisfying the initial conditions 
MiG ay cod ee 


*” A vector function is said to be continuous if all its components are continuous. 
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Remark 7. All the remarks made at the end of Sec. | 1 remain in force here 


Remark 2, Suppose the domain G contains a “strip’e <x <d,—-me< 
yp < Of = 1,...,n (where cand dare finite), and suppose the functions f 
satisfy a uniform Lipschitz condition in the strip. Then, just as in Remark i 
p. 42. it can be shown that if the “zeroth approximation” (i.¢., the function 
o, of Secs. 14 and [5) is continuous on [a, 8], the successive approximations 
converge uniformly on [c, d] to a solution of the system (4.4). | 


| Instead of studying singular points, singular curves and more generally, 
singular surfaces for systems of differential equations (as in Chap. 3), we now 
turn our attention to systems of Jinear differential equations. 


5 
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32, Definitions. Implications of the General Theory 
of Systems 


A system of differential equations is said to be /inear if every equation of 
the system is linear in the unknown functions and their derivatives. As 
shown in Sec. 27, every system of differential equations is equivalent to a 
system containing only first derivatives. Therefore we shall deal primarily 
with first-order linear systems, confining ourselves as before to the case of 
equations which are solved with respect to the derivatives. The general form 
of such a system is 


dy: 0 | ; 
ot > a,(xy; 4- f(x), hon Beer ee | (3.1) 
ax. jet 


which can be greatly simplified by using matrix notation. In fact, suppose we 
introduce the 2  n coefficient matrix 


A(x) = |]a,;(X) || 
and the 7 x L| column matrices (or vectors) 
FQ 
for= | 


yy) 
y2(x) 


y(X) = |] 


Inf) Vn (x) | 
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(cf. footnote 7, p. 98), which are handled by the usual rules familiar from 
linear algebra. (In particular, the derivative of a matrix or a vector is found 
simply by differentiating its components.) Then (5.1) takes the form 


dx 
as can be verified by calculating the matrix product in the right-hand side. 

From now on we shall assume that the functions a; (x) and f(x) are con- 
tinuous in an interval a < x < b, where this interval may become infinite at 
either end (or both). The right-hand sides of such systems have bounded 
derivatives with respect to every y, and hence satisfy Lipschitz conditions in 
every closed interval [a,, b,] contained in (a, 6). Therefore it follows from the 
theorem of Sec. 31 that one and only one integral curve of the system (5.1) 
passes through every point (xg, ¥°,..., p2) of the strip a<x<b in 
(X,¥1....,¥,) space. In fact, the theorem is immediately applicable to every 
parallelepiped of the form 


= A(x) 4- f(x). (5.2) 


a+ecqcxch—e, —-M<ay, < M, ee ara 8 


Where < > 0 is arbitrarily small and Mf > 0 arbitrarily large. But then the 
theorem also holds in the entire strip a < x < b. 

Every solution of the system (5.1) can be extended onto the whole interval 
(a, 6). This fact follows at once from the next to the last paragraph of Sec. 31, 
as applied to the closed interval [@,, b,] contained in (a, b). Thus the functions 
YiQ), -- +s Vax) constituting the solution of the system (5.1) may go to 
infinity as xa or x > b. 

If every f(x) = 0, we call the system (5.1) homogeneous [in this case, 
f(x) = 0 in equation (5.2)]. Otherwise the system is called nonhomogeneous. 


Problem 1. By the norm !'4 i) of a square or rectangular matrix is meant the 
sum of the absolute values of its elements. Prove that 


id + Bil < Al + WB, 
iCA ll = Jel {lA fl (¢ a number), 
NAB < PAL FBI. 


Prove that if the elements of a matrix 4 depend on x and have a right-hand 
(left-hand) derivative, then A(x)! has a right-hand (left-hand) derivative 
D. | A(x)'}, where 

ID, JACx) I] |< DAY), 


_ Problem 2. Using the preceding problem, estimate |D, |!y(x)|| | where x) 
1S the vector solution of equation (5.2). Using this result and Prob. 7, Sec. 12, 
estimate the rate of possible growth of the solution as x --a@ or x -- b. For 
example, 


CD. < Uylegdii exp |[* caceyy de) + [* ify lexp {[*ra@yi asl ds. 
* JO «3 


: Pa ‘ 
ie J 
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Problem 3, Prove that if all the functions a,,(x) and f;(x) can be expanded in 
Maclaurin series with radius of convergence no less than R > 0, then every 
solution of the system (5.1) can also be expanded in a Maclaurin series with 
radius of convergence no less than R. This is proved in the same way as the 
analogous assertion in the remark on p. 54. Choose the same majorizing 
function for every a;;(x) and f,(x). 


33. Basic Theorems for Homogeneous First-Order Systems 


By a linear combination of m solutions 


yy (x) vr (x) 
ye'(x) ys'(x) | 
yMPa=]o od. P= | (5.3) 
ye) dies 
of a homogeneous linear system, we mean a vector function of the form 
m 
3 Ca"), 
k=1 
where C,,..., C,, are constants. In particular, if every C, = 1, we speak of 
ihe sum of the solutions, while if C, = 1, C, = —1, m = 2, we speak of the 


difference between. the solutions y™(x) and y(x). 


THEOREM 1. A linear combination of solutions of a homogeneous 
linear system is also a solution of the system. 


Proof. Let y™(x),..., p(x) be m solutions of the homogeneous 


linear system 
dy 


a¥ Ave y =0 (5.4) 
dx 
or | 
“yt _ Sauy; =0, a! Perey (5.4") 
aX j=1 
If y is replaced by 
> Cyy(X) 


k=1 


in (5.4), the left-hand side becomes 


se Cyy™(x) — AQ) S Cy™() = x\c 


$c LPO) _ go9y% 9], 
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But every function (5.3) satisfies (5.4), and hence 

dy"(x) (ky ae 3 

= — A(x)y""(x) = 0, oa Ceram 7 


dx 
It follows that 


eu a‘ as ie Ate: 
aed SCA) — A(x) >, Cy (x) a 0, 
AX k=] ke] 


as required, 


DEFINITION. The m vector junctions (3.3) are said to be linearly 
dependent if there exist constants C,,..., C,,, not all zero, such that 


mM 


> Cy"! (x) = 0. 
k=] 
Otherwise, they are said to be linearly independent. The-determinant 
Wes. GOR. waar ati 
yO) yyy) yy (x) 


. My) yyy ne. ley 
Wey |i) YO) ye) 


* 


ya) ya) oy) 


is called the Wronskian of the set of vector functions 


yx yo 
yy(x)) yy?(x) 
ex) 4 : areas v(x) -_ . | (5.5) 


Yn (x) v(x) 


THEOREM 2. If the functions (5.5) are linearly dependent, then their 
Wronskian vanishes identically. 


Proof. This result is an immediate consequence of a familiar 
theorem of linear algebra. 


THEOREM 3, Suppose the functions (5.5), with Wronskian W(x), 
are solutions of the system (5.4). Then the functions are linearly dépendent 
if W(x) vanishes at any point x = Xy. 


Gependent, and hence we can find constants Ce ipareers Shy yams 60)! all 
zero, such that 


Proof, If W(x) = 0, the vectors y(x,), 02. , (xy) are linearly 


n 
Cee ARE pee ' 
> CLV Oa) = 6, 
Ki] 
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Now consider the vector function 


p(x > Cyr) 


ex] 


This function satisfies the system (5.4), by Theorem 1, and equals the 
zero vector. But by the uniqueness theorem, there can be only one 
function which satisfies the system (5.4) and vanishes for x = x,, and 
this function is obviously the function which vanishes identically. 
Therefore 


n 
2 Cy) == (), 
and the theorem ts proved. 


CoroLiary. Jf the Wronskian formed from the solutions (3.5) of 
the system (5.4) vanishes at a single point, then it vanishes identically. 


Proof. By Theorem 3, the functions (5.5) are linearly dependent, 
and by Theorem 2, their Wronskian vanishes identically. 


Remark. Theorem 3 breaks down if the functions (5.5) are not 
solutions of a system of the form (5.4) with continuous coefficients. 
For example, the Wronskian of the functions 


0 0 
vanishes identically, although they are linearly independent. 


Deriniton. A set of n linearly independent solutions of the system 
(3.4) is called a fundamental set of solutions of (5.4). 


THEOREM 4. Fundamental sets of solutions exist. 


Proof. Choose n®? numbers b{) such that 


(1) (2) : (7) 
by De by 
{1) (2) (n) 
bo bs Beye a b., god 0 
a) pe) 
b,, v, be 


e.9., the numbers 
(0 (oO for isk, 
i lt far eek. 
Now form == 1 solutions (5.3) of the system (5.4) satistying the 
conditions 
ota er he as = p{*), i ke oa ie L, Po er Y nN, 
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where xX, is any point in the interval (a, b). Then the Wronskian of these 
solutions is nonzero for x = Xp, and hence the solutions are linearly 
independent, by Theorem 2. 


THEOREM 5. If the functions (5.5) are n linearly independent solutions 
of the homogeneous linear system (5.4), then every solution y(x) of (5.4) 
can be represented as a linear combination of these solutions with suitably 
chosen constants i.é., 


y(x) = > Cry"). 
k= 


Equivalently, the general solution of (5.4) is a linear combination with 
arbitrary coefficients of any fundamental set of solutions. 


Proof. Let the vector function y(x) be any solution of (5.4), taking 
the value y(x,) for x = Xp. Since W(x.) 4 0, the vectors p(x), ..-> 
y'™(xy) are linearly independent, and hence y(x) can be represented as 
a linear combination of y™ (Xx), .. «5 y((%): 


n 


y(%o) = > Ce v"(x0). (5.6) 


k=1 


Now consider the vector function 


P= > Gy). 
k=] 


By Theorem 1, y*(x) satisfies the system (5.4). But on the other hand, 
y*(x) takes the same value as y(x) for x = X 9. Therefore, by the unique- 
ness theorem, y(x) = y*(X), Le., 


y(x) = Cry"), 
k==1 
and the theorem is proved. 


Problem 1. Let Y(x) be a square matrix whose columns consist of any 1 
solutions of the system (5.4). Show that such a matrix satisfies the equation 
dY AG) Y. 
dx = x) , 
where either det Y(x) = 0 or det Y(x) # 0. In the latter case, Y(x) is called a 
fundamental matrix of the system (5.4). Show that 


a) If Y(x) is a fundamental matrix, then the general solution of (5.4) is of 
the form y = Y(x)C, where C is any constant vector, 

b) The solution of (5.4) satisfying the initial condition y(%) ~~ y" is given 
by y= YQ)". 
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Problem 2. Find all the solutions of the system 
* NA ne ? ' 
XY, = 2yy — Yay 
« af Slitie ' ss * 
se ides 2 oa 
Show that 
a) If the initial conditions are specified for x, + 0, the solution exists and is 
unique on the entire real axis, but if xy = 0, the solution exists only if 
2y? — yp = O and is not unique; 
b) The Wronskian of every pair of linearly independent solutions equals 
ex, where ¢ 4:0. 


How can the fact that the Wronskian vanishes at only one point be reconciled 


with the corollary to Theorem 3? 


Problem 3. Solve the system 


Show that a solution exists on the entire real axis if and only if the initia! data 
satisfies the condition y] = 2y2. Show that the solution is then unique. 


Problem 4, Find a fundamental set of solutions and its Wronskian for the 


system 


ny 
y= Dax; i=1,....4, 
jal 


where the functions a,(x) are all continuous on the interval (a, 8). 


34. An Expression for the Wronskian 


THEOREM. Suppose the n functions (5.5) are solutions of the homo- 
geneous linear system (5.4), with Wronskian W(x). Then the relation 
between the values of the Wronskian at the points x and Xo is given by 
the expression 


W(x) = W (x,) exp| \: fai) + °° + a, (8) dg). (3.7) 
Proof. By the rule for differentiating a determinant, we have 
dx dx — dx yi yar yo 
W(x) = i ie oe i ee ee 
i ee ae We ee O. . dy) dy) _ dy | 
i A ee ae dx dx dx 


insane eerie 
‘Found by Abel in 1827 for second-order equations, and by Liouville and Ostro- 
gradski in 1838 for the general case. 
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Moreover, according to (5.4’), 


d (%) n 
ee = Xai yyy”. 


Substituting these expressions for the derivatives dy{’/dx into the 
formula for W’(x), and using the fact that the value of a determinant 
does not change if the elements of a row are multiplied by any number 
and then added to any other row, we find that 


fee? aay See sql 
W(x) = yz” ye vee yh 
Yn ad aed yo 
yd yo yoo 
oe SE ye i te yy 
an,y' en ce a,,y? | 


or 
W'(x) = ¥ ano), 


Integration of this differential equation gives (5.7), as required. 


Remark. This theorem immediately leads to another proof of the corollary 
to Theorem 3. 


35. Formation of a Homogeneous Linear System from a 
Fundamental Set of Solutions 


We begin by noting that a set of n vector functions y™,..., y@ with 
continuous first derivatives may not be a fundamental set of solutions of a 
system of the form (5.4) with continuous coefficients. In fact, according to 
Theorem 3, p. 106, a necessary condition for this to be the case is that the 
Wronskian of the functions y™,..., y vanish nowhere. We now show 
that this condition is also sufficient: 


THEOREM. Let y™,..., y ben vector functions of the form (5.5) with 
continuous first derivatives, and suppose the Wronskian of y™, ..., y'" 
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is nonvanishing. Then y™,..., y"™ is a fundamental set of solutions 
of a unique homogeneous linear system of the form (5.4). 


Proof. Consider the following system of hornogéneous linear 
differential equations satisfied by 7 unknown functions y,(x), ..., y,,(x): 


1) ek fn) 
MOV yy" 
Vo Rae eee io 
a) (n) ma foe mt 
* i] a »_ «* fi 
Va y n ¥ n 
(1) (ni) 


dy, dy; ad J; 


dx dx dx 


It is easy to see that any of the functions (5.5) satisfies these equations. 
Moreover, these equations can all be solved for the derivatives dy,/dx 
(i= 1,...,a), since the Wronskian of the functions y'?,..., yf”) is 
nonvanishing by hypothesis. To prove the uniqueness of the resulting 
system, we note that according to Theorem 5 of Sec. 33, every solution 
of the system is a linear combination of any fundamental set of solutions. 
But specifying all the integral curves of the system uniquely determines 


the direction field, and hence the right-hand sides of the equations 


oy a 
= = 2 4;,(%)y is 


since the coefficients of a linear form are uniquely determined by its 
values. 


Problem. Given a set of m continuously differentiable functions of the form 
(5.3), where #: <1, show that this set can be enlarged (by including new 
functions) to make a fundamental set of solutions (5.5) of some system (5.4) 
with continuous coefficients if and only if the rank of the matrix (5.3) equals 
at every point of the interval (a, 8). 

Hint. First construct the system (5.4) in a neighborhood of any point of 
(a, 5), 


36. Implications for Equations of Order n 
According to See, 27, the ner: linear differential equation 


d"y faa a 


Agar spree + Oyng 


x n-2 


dy | aie 
-+ Lh a(x) + aglx)¥ (5.8) 
ix dx 
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is equivalent to the homogencous linear system 
dx 
dx 
(5.2) 
dx 
dx 


cag Vn-1w 


= Ag(X)Vo + ay(X) Vy + ao(X) V2 + Opa) Vn 


where y, denotes y and y, denotes the kth derivative of y. 
Concerning the equation (5.8), we now make the following observations: 


1. According to Sec. 32, if the coefficients a(x) are continuous on the 
interval (a, b), then given any xX, in (a,b) and any set of numbers 
y®, yo,..., V9_4, there exists one and only one solution of (5.8) whose 
derivative of order i(i = 0, 1,..., — 1) equals y} at the point x = Xp. 
This solution exists on the whole interval (a, b). 

2. It is obvious that if the functions y™(x), ..., y0(x) satisfy (5.8), then 
so does any linear combination 


mH 


Gy) (5.10) 


rad 
with constant coefficients. 


3, Given m solutions 
Ak) 
¥o (x) 


WO) 


Hl ¥nea(X) 
of the system (5.9), suppose there exist constants C,,..., C,, such that 


m 


> Cro (x) = (5.11) 
k=] 


for all x in the interval (a, b). Then clearly we also have the identities 


YC.) 20 be ie 4 aaa 
k=% 


2 Here the functions y*(x) are scalar (ordinary) functions, and not vector functions as 
before. 
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Therefore we call the solutions y(x),.... (x) of equation (5.8) 
linearly dependent if there exist constants C;,...,C,,, not all zero, 
such that 


MW 


> Cyy"(x) = 0. 
ke] 


4, Bearing in mind that y,(x) is the derivative of order i of y(x), we see 
that the appropriate definition of the Wronskian of # functions 
Noe YOO IS OW 


1) A} ar in} 
} J 3 
ay aly? ay" 
dx dx be 


ras ae (aati _ ety?! 


dx n—-I Her dx! 


Thus Theorems 2, 3, 4 and 5 of Sec. 33 (and also the corollary to Theorem 
3) all continue to apply for the equation (5.8), provided we interpret a 
linear combination of solutions to be a sum of the form (5.10) and the 
Wronskian to be a determinant of the form (5.12). 


5. There is only one nonzero coefficient on the principal diagonal in the 
right-hand side of the system (5.9), namely the coefficient @,,_,. There- 
fore formula (5.7) now takes the form 


W(x) = W(x) exp {]. @.a(@) 4» 


Remark. Suppose the determinant (5.12) vanishes identically, but the 
functions v(x), ..., (x) are not assumed to satisfy an equation of the 
form (5.8) with continuous coefficients. Then it cannot be concluded that 
the functions are linearly independent, i.e., that they satisfy an identity of the 
form (5.11) with #2 =, where the coefficients C, are not all zero. For 
example, the functions 


yr@=x,  pQ=xbt  (-lax<D 
are linearly independent, although their Wronskian vanishes identically. 
Problem 1, Show that if 


. diyh®) 
if me aries coil 
ees id 


eee eon ee es eee 


% 


dx? 


then the system constructed in Sec. 35 is equivalent to a single ath-order 
equauon, 
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Problem 2. Suppose the solution y(x) of equation (5.8) has infinitely many 
zeros in an interval [a,, b,], where a <a, <b, <b. Prove that y(x) =0 on 
(a, b). 


Problem 3. Solve the equation 
y Pay 
by expanding y in a Maclaurin series. Establish the convergence of this series 
(cf, Prob. 3, Sec. 32). 


Problem 4. Prove that if the Wronskian (5.12) vanishes identically for 
analytic functions yx), ..., y'”(x), then the functions are linearly dependent. 


Problem 5 (G. K. Engelis). Let yx), ..., yx) [m <n] be a system of 
linearly independent functions on the interval (a, 6), where each yx) has n 
continuous derivatives. Prove that there exists an equation of the form (5.8) 
with continuous coefficients having the given functions as particular solutions 
if and only if the rank of the matrix 


qty h9) 
dx 


j Se Agecig ih SO sagt 1 


equals m at every point of the interval (a, 6). 


Problem 6. Prove that m functions y“™(x),..., yx) continuous on fa, 5] 
are linearly dependent on [a, b] if and only if the Gram determinant 


det ii yOc)y (x) dx 


vanishes. 


37. Reducing the Order of a Homogeneous Linear Equation 


Let the functions 
Dien ©.) Reeeer alias 29) (5:19) 


be m linearly independent solutions of equation (5.8) with continuous 
coefficients on the interval (a, b). Because of the linear independence, none of 
the functions (5.13) is identically zero (why ?). Let xy be any point of (a, 5) 
such that y(x9) 0. Since y'(x) is continuous, there is an interval (a, ,) 
containing X) in which 

Ly) > 0. 


Suppose we change the unknown function in equation (5.8) by setting 


yx) = y(x)z(x). 


® See the problem at the end of Sec. 35. 
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Then it is casy to see that 2(x) satisfies an equation of the form 


jars ue 1. [r-2 z de ; 
dx® ahs) § yo) ro oe “+ Gy_1(%) ae + ag(x)z, 


where the coefficients — are continuous on the interval (a,, 6,). Since 
yM(x) satisfies (5.8), the differential equation for z has the solution 


z(x) ae 
which in turn implies 
ag(x) = 0. 
Now let 
dz _ is 
ax 


Then the function y* satisfies the following homogeneous linear equation of 
order 7 — | with continuous coefficients: 


oy = aa is gs * 
ix) — RET Ses 12 ie a, n—2 (Xx) dyt3 Spit A ee ay (xyyye, (5.14) 
Moreover, it is easy to see that each of the functions 
(+1) 
= d y a : Se ie 
ye (x) = — ae nT ( male) A i=1,...,m-—1 (3.15) 
dx (x) 


is a solution of (5.14) on the interval (a,, b,). The functions (5.15) are linearly 
independent on (a, 6,). In fact, suppose there exist constants Cy,.., Cai, 
not all zero, such that the identity 


Se £8 is re) =e 
jay Nex) 


mT 


Sy Carry) ne Cys) = (0 


tivatonat! 


ist! 
where Cis a new constant. Therefore the functions (5.13) are linearly depend- 
ent on (a,, 6,). But then some nontrivial linear combination (5.10) vanishes 
identically on (a), 4), and hence on the interval (a, 5) as well, by the unique- 
ness theorem. In other words, the functions (5.13) are linearly dependent 
on the entire interval (a, 6), contrary to hypothesis. This contradiction shows 
that the functions (5.15) are linearly independent on (a, 4)). 

Starting from these nv: — | linearly independent solutions of (5.14), we 
can carry out the same procedure, applied this time to (5.14) instead of (5.8). 
This gives an equation of order m1 — 2 on some interval (a2, 3) contained 
in (a, ,). Repeating this argument, we finally arrive at a homogencous linear 
equation of order 2 — ni on some interval (@,,. 8.4): 


holds on (a, 6,). Then 
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Problem 1, Prove that with the assumptions made in. this section, any 
interval (d, 6) where @ < d@ <b < bean be decomposed into a finite number of 
open intervals in each of which the order of the equation can be reduced to 
a 992, 


Problem 2. Find the general solution of the equation 
(2v — 3x4)y" + dy’ + 6xy = 0. 
Hint, One solution is a polynomial in x. 


Problem 3. Give a simple method for reducing the number of equations in 
the system (5.4), starting from a nonzero particular solution. 


38. Zeros of Solutions of a Homogeneous Linear 
Second-Order Equation 


Consider the differentia] equation 
ou ED a(x))”" =e b(x)y ee 0, (5.16) 


Where the functions a(x), a’(x) and &(x) are continuous and bounded. In this 
section, we shall be concerned with the problem (of great practical importance) 
of how often a solution of (5.16) can vanish. Making the preliminary sub- 
sutution 


¥(X) = 2(x) exp \~ 


7: 


[Face ag], 


we reduce (5.16) to the form 


x" -+ B(x)z = 0, (3.17) 
where the function 
ana 
Bae ae aah 
4 ? 


is itself continuous and beunded. Since a(x) is bounded, the functions 2(X) 
and 1(x) vanish at the same time. The following is the basic result of this 
theory: 


THEOREM (Sturm’s theorem). Given to differential equations 


24(X) -+ B,(x)z,(x) = 0, 23(X) -i- Ba(x)z.(x) = 0, 


Suppose B(x) and B(x) are continuous on a closed interval a< x <b 
and satisfy the inequalit y! 


Bx) co» B(x) (a = x =D). (5.18) 


Let xy and xy (a< Xy <i Xy <b) be any two consecutive zeros of 4 
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solution z,(x) 32 O of the first equation! Then every solution z(x) of the 
second equation has at least one zero between x, and X., provided that 
za(x) does not vanish for x = x, and x = Na. 


Proof. Let 2,(¥) be a solution of the first differential equation and 
fe 2) a solution of the second. Subtracting the second equation multiplied 
by z,(x) from the first equation multiplied by z.(x), we obtain 


zi(x)zu(x) — 2,(4)22(8) = [Bo(x) — By(x)]2,(¥)22(x). (5.19) 
Noting that 


4 es aes d we , a 
Zi(N)Za(NX) — 2,(X)24(xX) = ae (zi(x)z.(x) — 2,(x)z5(x)], 
dx 


we integrate the identity (5.19) from x, to X2, using the condition 


2,(%,) = 2,(%_) = 


This gives 
2!(Xu)Za(Xe) — 2i(%;)z0(Xx ye [Bo(x) — By(x)]zu(x)zo(x) dx. (5.20) 


By hypothesis, x, and x, are consecutive zeros of 2=,(x), and hence =,(x) 
does not change sign between x, and x,. Since 2,(x) satisfies a linear 
homogeneous equation, so does —2,(x), and there is no loss of aes 
In assuming that z,(x) is positive between x, and x.. Clearly zj(x,) = 

zy(x2) = O, since otherwise 2,(x) would vanish identically. This oli 
with the positivity of 2)(y) on (xy, Xe) implies 2;(x,) > 0, zy(4%2) < 0. 
Suppose there were a solution z.(x) which ts nonvanishing in the open 
interval (x,, ¥.) and at one (or both) of the end points x,, x2, contrary 
to the conclusion of the theorem. Then we could assume without loss 
of generality that z,(x) is nonnegative on [x;, Xe]. But then the left-hand 
side of (5.20) would be negative while the right-hand side would be 
nonnegative, since (5.18) mplies 

B(x) — B(x) > 0. 
This contradiction proves the theorem. 
COROLLARY |. No solution of equation (5.17) can vanish in the interval 

(a, b) more than once if B(x) < 0 for all x in (a, 6). 


Proof. If a solution z(x) of (5.17) vanished for x = x, and x = X, 
(i ae ONS CD), en by Sturm’s theorem, every solution of the 
equation 2”(v) = 0 would have to vanish at least onee in the closed 
interval [x,, Xa], which is obviously impossible. 


* Als (¢) nas infinitely many zeros in the closed interval [a, 6], 1s not hard to see that 
(a, b] contains a point where both 2,Qv) and 21(x) vanish. But then 2,(%) = 0 (cf, Prob. 2, 
See. 36). 
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COROLLARY 2. Let x, and x, be two consecutive zeros of any solution 
of equation (5.16). Then every other solution of (5.16) has precisely one 
zero in the interval (x,, X2) if the ratio of the two solutions is not a constant. 


Proof. Use Sturm’s theorem with B,(x) = B(x). 


Problem 1. By comparing the solution of the equation 


OP se 
z+ ftil— zs Jz = 0 
x 


(a transformed version of Bessel’s equation) with the solutions of the equation 
y’ +y =Oory” + (1 + &)y = 0, show that the distance between consecutive 
zeros of z(x) # 0 is less than = for 0 <n < 4, and arbitrarily close to = for 
sufficiently large x. How are the zeros of z(x) distributed for n > 3? 


Problem 2, Show that consecutive zeros of every solution of the equation 
y" + xy =9 
become arbitrarily close together as x > ©, 


Problem 3. In connection with Sturm’s theorem, show that if zo(x,) = 0 and 
if (5.18) becomes a strict inequality B,(x) > B,(x) for at least one point in (15 X2), 
then the zero of z,(x) immediately after x, lies to the left of x. 


Problem 4. Suppose the inequality (5.18) holds, with z;(x,) = z)(x,) = 0 
and 2;(x,) > z3(x;) > 0. Prove that z,(x) > z,(x) on the interval from x, to the 
next zero of z,(x) after x,, while if there is no such zero, z,(x) > 29(x) for all x 
in (a, b). 


Hint. First assume that z}(x,) > z9(%,). 
Problem 5. Suppose the function B(x) in equation (5.17) is defined for 
0<a<x < o and satisfies the inequality 


+e | 
B(x) > 2 (<c = const > 0). 


Prove that every solution of (5.17) has infinitely many zeros. However, prove 
that every nontrivial solution has no more than one zero if 


I 
Bix) < Axe’ 


Hint. Make the preliminary change of variables x = e’. 


Comment. For other theorems on oscillations of solutions of (5.17), sce 
Bellman’s book.°® 


’ R. Bellman, Stability Theory of Differential Equations, McGraw-Hill Book Co., New 
York (1953). 
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Problem 6. Consider the equation 


-Bx,ay=O0 (asx <b, At << A**), 
where the coefficient depends on a parameter %, and suppose that 
a) B(x, 2) is continuous in both x and 3; 
b) B(x, 2) is a nondecreasing function of % for fixed x; 
c) For any N > 0, there exists a 2 such that 


l 


B(x, 2) < = (a <x <b) 


and a % such that 
B(x, 2) > N (a, = XS by, ac ay <= b, a b), 
where a, and a are fixed. 


Prove that there is a sequence A* <2, <2. <-+++ <4, <--> converging 


119 


to 


A** such that the equation under consideration has a nontrivial solution for 


the values 4), A2,... and only for these values, and moreover 
a) The solution vanishes for x = aandx = b; 


b) The solution for % = %, has precisely » — I zeros between a and 6. 


39, Systems of Nonhomogeneous Linear 
First-Order Equations 


THEOREM. Let the vector function (x) be any particular solution of 
the nonhomogeneous system (5.2). Then every solution of (3.2) can be 
represented in the form 

r@) =e) + 20). 
where u(x) satisfies the homogeneous system (5.4). Conversely, every 
function y(x) of this form satisfies (5.2). 


Proof. The direct assertion follows from 


dv dy do . : 
= ae SAY E a 40x | = f(x) — f(x) = 9, 
dx dx dx 
and the converse is proved similarly. 


CorOLiary. Every solution of the nonhomogeneous linear system 
(5.2) can be represented in the form 


y(x) = 9(%) y+ SC ere Ge (5.21) 


Kel 
where the functions v(x), 2... v(x) form a fundaniental set of solutions 
of the corresponding homogeneous system, and C,....C, are constants 
(uniquely determined by the given solution), Conversely, the right-hand 
side of (5.21) satisfies (5.2) for arbitrary Cy, «++. Cn 
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Put somewhat more concisely, she general solution of a nonhomogeneous 
linear system is the sum of a particular ‘solution of the system and the general 
solution of the corresponding homogeneous systeni. In other words, the 
problem of finding the general solution of a nonhomogencous linear system 
reduces to finding a par ticular solution of the system, once we know a funda- 
mental set of solutions of the corresponding homogencous system. To solve 
this problem, we now use the method of variation of constants, just as in the 
case of a single linear first-order equation (see Sec. 7). 

Thus, given a fundamental set of solutions v(x), 2... Cx) of the 
system (5.4), suppose we iry to satisfy the nonhomGsencouls system (5.2) by 
writing 

% 

ro) = FC,C0y""), (5.22) 
Pe | 

where the C,(x) are now arbitrary differentiable functions instead of constants. 
Substitution of (5.22) into (5.2) gives 


XC, “(x)y"'(x) #30, Coes) ->. A(xX)C.(x)y"(x) 


= SCuayy(s) +E Caadly (2) — ACO) 
Iks=] 


fi 
= Cx Hs). 
ae 


Writing out the components of this vector equation, we obtain 


Th 

> Goye(s) = fx), = 1. (5.23) 

ae 
which is a nonhomogeneous linear system in the unknown functions C,(3). 
Moreover, the determinant of the coefficients of these unknowns Is nonzero, 
being the Wronskian of the functions y/?(x). Therefore we can solve (5.23) 
for the functions 

CAX) =o, ¥), a Cerner | 


and then integrate, obtaining 


“? ‘ i : ' + : og a a, 

Cx) = | lx) dx + OC, = FX) + Cn (5.24) 
where the C,, are arbitrary constants which can be set equal to zero, since we 
are interested in only one particular solution of the system (3.2). In terms of 
the functions ‘I’,(x), the required particular solution takes the form 


ti 


y(Xx) = s a x yes ) 


RB a 1 


If the C,, are left arbitrary, substitution of (5.24) into (5.22) gives the general 
solution of (5.2), 
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Problem 1. Suppose the functions 
he ee oe a ee 


not only satisfy the system (5.4) in x for every fixed &, but also satisfy the 
following conditions: 

lke z) et (1 it k, 

oe ee \0 if f#k, 


Prove that the functions 


* 
Pi 


rt : 4 * wey — 
yi =| > Aye, 243 
wee k=l 


satisfy the system (5.1) and a zero initial condition y(x)) = 0 at any point x, 
where the equations (5.1) are being considered. Use the fundamental matrix 
¥(a:) [see Prob. 1, Sec. 33] to represent this solution in the form 


yy = [7 YooTY@ry@ di. 


Problem 2. So far we have selected a particular solution from the general 
solution by specifying an initial condition, i.c., by specifying all the components 
of the vector function y(x) at the same value of the independent variable vw. 
However, side conditions of a more general nature can be imposed on a system 
of differential equations. For example, different components of 1(x) can be 
specified at different values of x, or values of linear combinations of the com- 
ponents can be specified at different values of x. Thus suppose the system (5.2) 
is subject to general linear side conditions which we write concisely as 


Lif] = 2h, See Tg Oh 


where the x, are given numbers and the L; are given combinations of values of 
the components of y(x) satisfying the linearity condition 


Licey a Gy] a, C.L,[y"] -- CoLfy"]: 


Prove that there are then two possible cases, depending on the coefficient 
matrix -i(y) and on the form of the functionals Z,., e., a “basic case” where the 
problem as posed has precisely one solution for any (continuous) function /(x) 
and arbitrary values of the «,, and a “singular case.” In the singular case, the 
problem as a rule has no solution at all for arbitrary f(x) and z;, but there are 
infinitely many solutions if a solution exists. Prove that 


a) A necessary and sufficient condition for the basic case to occur Is that the 
corresponding homogeneous problem [with f(x) = Q and all x, = 0] 
have only the solution tdentically equal to zero; 

b) The basic case always occurs for the system (5.2) subject to an iniual 
condition. 


Give examples illustrating the singular case. 
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40, Implications for Nonhomogeneous Linear Equations 
of Order 


The — linear equation 


a's \ ay . ' 
at = 4,3) al “Fe dy)“ dytn? a(x) # sf 4- Ag{x)}! -- f(x) 
: (5.25) 

is equivalent to the linear system 

dy 

m= he 

dx 

dy, 

os Yo, 

dx 

dy 2 

dx a Ya-b 
Gis os ) te eects 
Pet = aghx)¥o + ayCN + ayC)Yy FF ay Aa + LO. 


In this case, the system (5.23) takes the form 


Ci(x)y") + Cry gio hs Ci(x}p = 0, 
dy™ ay Pe sa 
Ci(x re + C.(x) ewe CH) ae), 
dx dx 
a a) Co (tahy@) gr-Byin 
est + C(x) Spon Ne a): 
ax” dx 


Obviously, the only function y,(x) of interest here is yy(x) = y(x). Therefore, 


having determined the f unctions C,(x), we need only substitute them into the 
formula 


3 
¥o(x) = 2, Civ (a). 


Rees 
Pears 


Problem I, Formulate conditions a prove the assertions of Prob. 2, Sec, 39 


as applied to equation (5.25). In particular, analyze the “boundary value 
problem” 


and also the analogous problem for the equation 


y 4s ¥ a f(xy. 
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Problem 2 (Analogue of Prob. 1, Sec. 39). Show that the solution of 
equation (5.25) with zero initial conditions 


yO) = yO) = = MX) = 
is of the form 


ye =|" EGC, 8 a5, 


where for every fixed = the eon G(x, 4) satisfies the corresponding homo- 
geneous equation (5.8) i in x and initial conditions 


YS) = yp") He = yy = 0, oy N(E) = 1. 


Problem 3. Using the result of Prob. 2, prove the following generalization 
of a theorem due to Chaplygin: For any xy in an interval where the coefficients 
of equation (5.25) are continuous, there are positive numbers Mp, Ay, ---. May 
depending only on x, and on the coefficients ag(x), a,(X), ..., @,-7¢X) such that 
if (x) is a solution of (5.25) with 


[(x) > 0, ¥%) = yo) = = VOM) = 0, OM) > , 


then yx) > 0 (yy Sx Saxy t+ Ay, k = 0,1,...,2—1), and moreover 
yx) oO (vg <8 <XxXy + My) if it is also known that f(x) > 0 (vy <x < 
Xo + A,). Express the least upper bound of possible values of the numbers hin in 
terms of the properties of the function G of Prob. 2. (Can & be set equal to ?). 
As an example, consider the inequality yo~y > 0, given that +(0) = 0, 

y'(O) > O. On what interval are its solutions ouaranteed to be ete How 
Ae derivatives of solutions? Prove that an upper bound for the coefficients 
letg(x)1, lay(x)],..., la@,_y(x)| can be used to deduce a positive lower bound for 
the numbers /,.. Use Hadamard’s lemma to deduce sufficient conditions for the 
validity of Chaplygin’s theorem for nonlinear equations. 


6 


LINEAR. SYSTEMS: 
THE: CASE “OP CONSTANT -COEFFICIENTS 


41. Transformation of a Linear System 


In this chapter the unknown functions »; and the functions a,(x) and 
JAX) appearing in (5.1) will be allowed to be complex, but we still assume that 
the independent variable x is real. The real part of a complex function ¢(x) 
will be denoted by Re (x) and the Imaginary part by Im o(x). Thus we have 


7 On, ey Se ead 9 _ Re O(x -+ Ax) — Re (x 
2'(x) = [lim a) = lim ( = (x) 
Az-0 LAN Agv-- 6 AX 


Im 9(x -+ Ax) — Im (x) 
ee 
Ac Ax 


— fl Re o(x) ++ pas Im 9(x), 
dx dX 


assuming that the last two derivatives on the right exist. It follows that a 
C, and (x) are complex, then 


—* ey as en 
[> Cyete)]'= ¥ C9509), 
) j 


just as for real C; and 9,(x). Similarly, we can show that derivatives of prod- 
ucts of complex functions are given by the usual rules. 
From now on we make the additional assumption that the coefficients 
a, (x) figuring in (5.1) are constant. Then (5.2) takes the form 
2 Fahtte (6.1) 
Ae Ce), 


dx 


124 
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where A isana » coefficient matrix, while f(x) and o(x) are the given and 
the unknown column vectors. In solving (6.1), our basic idea will be to 
transform the unknown vector into simpler form by using a suitable linear 
transformation. 

The linear transformation 


i 
hig TE dpe (6.2) 


j=l 
can be written In the form 


rie ae Gn 


where z = r(x) is anew unknown vector, and K is an a “ 1 matrix charac- 
terizing the transformation. Restricting ourselves to nonsingular transforma- 
lions, for which det X = 0, we can solve for v: 


te Kos 
p= =. 


Then substituting for yin (6.1), we obtain 


ae a <= AK: + f(x). 
dx 
It follows that 
“KAKO: + KSC). 
dx 
or finally 
Ue Beha, (653) 
dx 
where 
B= KAK", — 9(x) = Kf(x). (6.4) 


The system (6.3) has the same form as the system (6.1), except that the 
coeflicient matrix A has been replaced by B = KAAW'. Starting from a given 
matrix «fl, we naturally try to choose the matrix Kin such a way that 8 takes 
the simplest possible form. In courses on linear algebra, it is shown that there 
is always a choice of A such that the matrix 8 has “Jordan canonical form,” 


in which “Jordan blocks” [1,. fl ....0h. (i) -< & < 2) appear along the 
main diagonal and all the other elements are zero: 
Hy 
Pe 0 
B= : 


entry, 

+ 

oo 
~ 


POE EE aR se iinet OEY oe oe th acraeantinente di a Oe 


ndagea ts 


ne Beet 


ee 


eh OPP natn OCCT 


Selo. nSinra hor wtieasept pS 
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Every block IT, is a square matrix of order N; (1 My < it) of the form 


ia, | 
x, ; 0 i! 


where one of the roots 4, of the characteristic (or secular) equation 


of the matrix A (£ is the unit matrix of order #7) appears along the main 
diagonal, the number | appears along the diagonal just above the main 
diagonal, and all the other elements are zero. The order of the matrix I], 
equals the degree of the ‘‘elementary divisor "(A — A,)”’ corresponding to the 
root 4;. Ifa given root has several elementary divisors, each of the correspond- 
ing blocks in the Jordan canonical form has the same element along its main 
diagonal. In the simplest case, where all the elementary divisors are of the 
first degree (for example, ihis occurs if all the roots of the characteristic 
equation are distinct), all the blocks are of order 1, 1.e., the matrix B is in 
purely diagonal form. 

In what foilows, we shall find it more convenient to use Jordan blocks ofa 
somewhat different form. It is easily verified directly (the reader should do 
this) that if we set 


; a 


iu i | 


Where the indicaied diagonal contains arbitrary nonzero elements and all the 
other elements are zero, then 


RAR 


rs 


] 
1 
i= 


dy A; 
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In this way, we can make the numbers under the main diagonal completely 
arbitrary, thereby obtaining a block of the form 


hj 
Y. A; 0 
fe 
be . . : (6.5) 
0 oy 
Cony hj 


Suppose we subject all the blocks [1, to transformations of this kind, so that B 
is transformed according to the formula 


B — LBogl 3: 


new 
where 


fe 
x 


Then the matrix B,,.,. will be made up of blocks of the form (6.5), and moreover 


new 


fom | 


Buew = LKowdAKoial" — (LK ow)A(LKota) ac KAnewAl News 


t.e., the transformation to blocks of the new form can be accomplished by 
changing the transformation matrix as indicated. 

Suppose such a transformation has already been carried out, so that the 
matrix B appearing in the system (6.1) consists of Jordan blocks of the form 
(6.5). Then, going from the matrix notation to the usual scalar notation, we 
obtain the following canonical form for a system of linear differential equa- 
tions with constant coefficients: 


“de, ; 
a “ir AQ EY + g,(x), 

dx 

ol ae 

7a = yoy Aye + ga(x), 

dx 

dz, é | 

oa == Kees + M423 Ba), 


dx 


+ * 4 . * . . * 
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dz, 
HY =. _ ce -s P 
Ayden yd MEn, i Pie): 
dx 
(eae 
P 1 a - on Ji é 
ee i a Age ytl r &), walt), 
dx 
dt 
ytd _ - - ' - 
aed iy BZ apt = Age Mabe _ Sy al), 
dx 
rae 
nit 3 Sear a ms ' ~ se ; 25 
i ale aes B92, en i AS es a Zn ralX)s 
dx 


dz 

; 
“Mgina 
an C8 EL 


~. I * ~ (: ma 
€ _—— : # ; he 
4no-lnyt nel | hoe nine ' Snytnes Xx), 


dx 
eter er é3 

a Aye ype OO Eee eso) 8 

dx 
dz,» 0 a 

on — ee oe “ ~e eae - 

dx = Opens Ags pe ny 27 Sn nyehX)s 
dE nis —— - fs ani ' Y ae 

dx ve am mn ped TT Ape ye ers ae 2n—nyralX), 
dz 
— Be - me ' ee 6.6) 
dx Mne-de art Age y, = HAND: (0. 


Here the 4,, %;, 8, ..., 0, are complex numbers, and the z;, %;,....@, can 
be given arbitrary nonzero values provided they are nonzero (in particular, 
they can all be made arbitrarily small), On the other hand, the 4; are com- 
pletely determined by the given system. 

_ After the system has been reduced to canonical form, it can easily be 
integrated, In fact, the first equation of the “canonical system’’ involves only 
one unknown function z,(x). Determining z,(x) from this equation and 
substituting the result into the next equation, we again obtain a linear equa- 
lion involving only one unknown function AN. and so on. 

__ The integration of linear systems with constant coefficients will be con 
sidered in detail. staruing from Sec. 45. In Sees. 42-44 we digress to develop 
the theory of reduction of linear systems lo canonical form. 
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Problem. Prove that Lagrange’s formula for finite differences may fail for 
complex functions of a real variable. However, show that the formula 


AL f(b) — fla) = vf) 


holds for a function f(x) differentiable on a < x <b. Here a < = <b, > and 
y are real numbers which tn general depend on a, 6 and f, and 27 + vu? «0, 


42, Reduction to Canonical Form 


THEOREM. Given a system of linear differential equations 


n 


dye.. . _ 
== Sas FS heat (6.7) 
jul 


- 


dx 


with constant coefficients a,,, there exists a linear transformation 


with constant coefficients c,, and determinant |c,;| == O which reduces (6.7) 
to the canonical form (6.6), where the fictions g(x) in the new system 
are linear combinations with constant coefficients of the functions f{X). 


Proof. The theorem is obviously true for # = |. Assuming that the 
theorem is true for x — | equations, we now show that it is also true for 
n equations, thereby establishing the proof by induction. 

We begin by multiplying the ith equation of the system by x; G = 
I,..., a), Where the &, are certain constants to be determined later. 
We then add the resulting equations, obtaining 


n be = 

ae Sau? ake une ML Pee ee: eee S ryt 

> a,kV; = ADK SY, = pee k iy; 
t,j=ul ral j~t 


an identity in the y,, where % is a real or complex number. Obviously, 
a necessary and sufficient condition for this relation to be an idenuty ts 
that coefiicients of identical y, be the same in both sides, Le., that 


Comme 
i 


Fe 
VdgkeS ih. J Svcs 
ak 


Thus the &, are determined by a system of 1 homogencous linear equations 
in #2 unknowns. A necessary and sufficient condition for this system to 
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have a nontrivial solution (and this is the only case of interest here) 
is that the determinant of its coefficient matrix vanish, Le., that 


det (la,;{| — KE) = (—1)" det (AE — |a,;|l) = 0, (6.8) 


where £ is the unit matrix of order 2, Equation (6.8) ts called the char- 
acteristic (or secular) equation, and plays an important role in many 
problems of mathematics, physics and astronomy. Correspondingly, the 
matrix AE — |la;;j] is called the characteristic matrix of the system (6.7). 

Now let A, be one of the roots of equation (6.8), and let k,; (i= 
],....#) denote any set of numbers, not all zero, satisfying the system 
of equations 


> djki: Si Agky; CF == 15.330 {3D 


Suppose k,, = 0, an assumption which entails no loss of generality 
since it can always be achieved by suitable renumbering the }; (the 
corresponding transformation ts nonsingular). Moreover, let 
n 
4) es, (0.9) 


The function 2, satisfies the equation 


dz : 
= a + gil) 


Seema 


as 
where 


g(x) = 2 ky, F(X). 


We write this equation instead of the first equation of the system (6.7), 
and then rewrite all the other equations of (6.7), with y, replaced by the 
expression involving 2, j's,..., }, implied by (6.9). Such an expression 
exists, since ky, = 0 by hypothesis. This gives a new system, which we 
denote by (6.7’), of the form 


ne a 
ee AyZy ~{ z,(x); 
dx 
dy er e se sk . 
MCI mane me Qayly) =" Guo Va ae ag Vs =“ “oe -- Ai, \ -j- f(x), 
dx 
eer 
ape oi Aga dy I agg 8 A Maha oP F(X), 
dx 
Wy kg 
—— = 7} ot . j ™ ‘ ‘fk a 
= Ue] ad n2y2 i GygVy att cant. nnd n <a pes) 


dx 
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Next, assuming that our theorem has been proved for n— 1 
equations, we apply it to the system obtained from (6.7’) by omitting the 
first equation, at the same time regarding z, = z,(x) as a known function 
just like the f(x). Then there exists a nonsingular linear transformation 


nm 
ade Se pe, ae 5 sa 
i D hoya. ped ie eB 
722 
reducing the system (6.7°) to the form 
dz, 7 
—— = Aye, -F 8i(%), 
dx 
ok 
ies : 7 
= =: baty Ag VE + fa(X), 
dx 
d vy 5 : ~ 
—S== byt, -b ay yd + dayg + S/a(X), 
dx 
dy* A ae ‘ 
a by, + says cb Ady + ACN), 
dx 
Ne as ; 
ea 7 3 x 3 Zi = 24 
= b,aaZ1 Se a Vy Pa et Faye) 
dx 
Ove ae ‘ x p 
= Benth ss Ag¥ yee Snel); 
dx 
° prs Q ok . * we ? : Yue 
irae Dy ea%1 mid nye? a" AgVy a Jaya), 
dx o 
(6.7") 
Vek QA * x # 2 3 i 8 
b,, re Paves 13M yea 7 Jn 4%), 
dx 
d ey -h + : 3 : 
* re ? 1 } + ¥ ™ vk ete ; . 
— Diy. nyrl=l oe Dae ans 3 AgM yee ld 9 | per es: 
dx 
ONT ny 1 


cs CRT 
dx 

ay Ranged 
dx 

dy 

ae ee IAry 


b nome 


a ‘ 
#11 a heey git ncn: 


t+ da | Ae eS as Te +2 a(X), 


Meet ~ 


ta 


3 
i coy} H-ngewl 


~~ 
= 

” 
to 


* * : 
Poy hil os oe : Y 
mo | a No Q@ Vien e ae 0 fas Ager wa nped i , eal’), 


+ f,(3). 


Pe eons sD 
aes Bos hy aii N 
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To reduce (6.7) to canonical form, all that remains is to eliminate 
some of the f;. Since every group of equations from the second to the 
(1, + I)st. from the (jy -+ 2)nd to the (ay +- ty + Hst,..., from the 
(n, — 8, + Ist to the ath has the same structure, we shall only show how 
Lo eliminate some Of the constants dg, Og, ... . Oy4.,. Here two cases must 
be distinguished, depending on whether or not A, = Ag. 


Cuse 1. If by = Ba, We set 22 = 


“4. Cz,. Then 


J pu 


dz. aye Be bor . 
pares se erste CS Dae et Aas = CAs 4 Ban) 
dx dx dx 


= Dae: — Anta Tae Chass ae CAAZ -1- 2(X) 
= Avkn -+- [b. ts COy — Res) 23 ~ go(X), 


where go(x) is some linear combination of g,(x) and /,(x). Let C be such 
that 
Ds -1- CA, ae ha) —_ Q, 


This is possible, since 2, = 2% by hypothesis. Then we obtain 


.. bs has now been eliminated from the second equation. 
ee let zy = yy ++ Cyz,. Then 


Hey Ve > ps tbB , . 7 
Se Gg ey ys eC yay Sal) 
dx dx dx 


— (OD. —. Cy; -}- Cala mars Gy a +- Ayan ~~ hola ~ 24(X), 


i 


where g(x) is some linear combination of g,(x) and f(x). Let C, be 
such that 


which is again possible, since 2, #2 A,. AS a result, we obtain 


and in the same way, we climinate the constants 5, from the rest of the 
equations in the first group, 


Case 2. If A, = An, we set 


cae 


nytl “dtp Da 4aey Faia, oe =, 


"Hy ay? 

where an is any nonzero constant. The second of these equaulons can 

be solved for Jn, and z, , since a O and 4, , #0. Then the 
e I 


¥ 
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(4, < st and ath equations can be written in the form 


eee away Sect, ve ae ae y) we ote f ( al 
~ Fiymay ede s Caan tad | x), 
Lah 
doa: ee ] dz, ‘ Fp d\* AS 
= a rae ae 
F apt . eet * 
dx tie HN ie. aN 
b, 1" T y Dy, elt ig 
~~ . * yy 72 pe ree ri 
Any Fny 
; Fy p-ybn 2 Be Shs 1 - 
x ‘ ae Vi il 7 - ce fay} ity en e5(%) 
Any Dns 
Bie fy Bie tid iy ; Fn ttn - 
—— oe ot ; ae ; ity] 
* ‘ * os J 4 I 
Sony Sy, Tiny 
me 
ee Finny Dinsat ty 2 
tT lea i ve g,,,(X) 
Zz " 
"Ay 
- Bi 240 a: - Fy 140-2 * ee 
oe ss =a ae Ke J ay-k Aan, ‘ g,,(*) 
: om. 
thy "Hy 
[f we write 
* 6 - iO a, _ Fy 14,3 
Ay, peimny- bl a ae See ay May-b 
Fin, Fits 


Where x, Is any nonzero constant. the mth equation becomes 


ae : - | 

ae = Futana T Agena, 7 g,,,(%), 
and the remaining equations in the first group can be treated similarly. 
In this way, we get rid of 4, eieluee .5;, 63. We cannot eliminate 
6. (unless bs = 0 from the SitseD), but ae 2, = 2. this is not necessary 
for reduction to canonical form. However, if 6, = 0, the substitution 
zy = Cz can be used to make 4, arbitrarily small. We remind the reader 
that ¢,v) always denotes a linear combination of g,(x) and f(x). 

Now suppose that in addition to Ag, there is another 4, say 43. which 
equals 2,. Then Ob, ais Dig eee ie veniget Can be eliminated by the same 
method as before. To avoid introducing new notation, we shall assume 
that 


already holds in (6.7"). However, 6, and Dig .2 may be nonzero. If 


by = 0, We can Interchange the groups of equations —— to 
42 and 2. and then it will be impossible to eliminate 4,,, .gin the reduction 


134 LINEAR SYSTEMS! THE CASE OF CONSTANT COEFFICIENTS CHAP, 6 


of the system to canonical form (unless Dy an = 0 from the outset), 
On the other hand, if both 6, 4 0 and 4, .. = 0, assuming that a, > a, 
(which can always be achieved by interchanging the groups of equations 
corresponding to A, and Ag), We set Z,, 29 = Va ao + Ciys. This gives 


3 {,,% 

dz nyt dy ayto | dvs 
a = = i 
dx dx dx 


~ 2 a i ms : 5 
= by sat) + AgVnyee t Cybaty + Cydayd + En 42%) 
= Dy ot, Asz aye — AgCiyr ++ Cybez, -+ Cyagy? + 2 eel). 


Since 5, 32 0 by hypothesis, we can always choose C, such that C,b, = 
—b,, 49, obtaining 
aera a 
———— = Ag na 1 Sn galX), 
dx 
since A, = 43. Moreover, replacing Vii. by z,,.2 — Ciy¥ in the next 
equation, we find that 
fy 
ese = —§,Cirs + 6,2, ee tate AgVn es aa Fives) 
ax 
and the term in yy can be eliminated from this equation by the sub- 
SULUUON =, 3 = Yas + Coyz. Making repeated transformations of 
this kind, we eventually reduce the system to canonical form. 

Finally we note that all the linear transformations made in reducing 
the system to canonical form are uniquely invertible, i.e., the linear 
relations connecting the new variables with the old variables are such 
that the old variables can be uniquely determined from a knowledge of 
the new variabies, Therefore the transformation carrying the yp, into the 
z; is itself linear and invertible, and hence nonsingular. The proof of the 
theorem is now complete. 


Kemark, From a practical point of view, the method just described for 


reducing the system of differential equations (6.7) to canonical form is very 
formidable. Therefore it is desirable to find methods which more rapidly 
reveal the structure of the canonical! system (1.e., the numbers A, and the 
number of equations corresponding to each 4,). Such methods will be pre- 


sented in the next two sections. 


43. Invariants of the Characteristic Matrix 


THEOREM, Suppose the nonsingular linear transformation 


Nl 
2, = D> kis p i. Cree | 
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carries the system 


ve. ae 
es = 249» ae IRE | (6.10) 
into 
dz, de | 
See) OZ, ea § Peerare | 8 (6.11) 
a SS; 
Then 
AE — B= KOE — A)KA, (6.12) 
wiiere 
ia;;l| = A, 5;;| = 8, WA,;] = K 


and E is the unit matrix of order n. In particular, 
det (AE — B) = det (XE — A), 


ie., the determinant of the d-matrix 2~E — A is invariant under the 
transformation.’ Moreover, the greatest common divisor (g.c.d.) of all 
minors of order 1(1 = 1,..., 1") of the characteristic matrix of the system 
is invariant under the transformation, in the sense that the g.c.d. of all 
minors of order | of #E — B is the same as the g.c.d. of all minors of order 
lof xE — A, to within a constant factor” 


Proof. The first assertion is almost obvious: Replacing every 2; 
in (6.11) by its expression in terms of 4),....,, We obtain 


Ao dy i ise 
D kis = => Oy D Kya) 
J=1 dx bee | saz] 


and hence, after substituting from (6.10), 


ni ? tt % 
2 Ki; > 55s a > Kade oe > b; jk 55)» 
j=1 s=] j,3e1 jysel 


Since this formula must be an identity in the y,, we have 


R 


> kij@js = D Oiykjs 
j=l j=l 


for every i and s, and hence, in matrix notation, 
VA —_ BK or B — KAKA, 


Which is equivalent to (6.12), since E = KEK~*. The invariance of the 
determinant follows from (6.12) and the familiar fact that det Af = 
det Af det N (where M and N are arbitrary n % 7 matrices). 

' By a Qemmazrix is meant a matrix whose elements are polynomials in 2. 

* In calculating the g.c.d., think of the minors as polynomials. 
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The proof of the invariance of the g.c.d. is a bit harder, Let a,(a) be 
the g.c.d. of all minors of order / of AZ — A, and let b,(A) be the p.e.d. 
of all minors of order / of AE — B. Every minor of order / of the product 
K(AE — A) is a sum of products of minors of order / of K with minors 
of order / of AE — A? Therefore every common divisor of all the 
minors of order /of AE — A divides every minor of order / of K(AE -- A). 
Similarly, every common divisor of all the minors of order / of AE — A 


hence divides b,(A). But then a@,(A) divides 6A). Reversing the roles of 
A and B, we see that 5,(A) divides a,(A). It follows that a,(A) and 5,(A) 
agree to within a constant factor, and the theorem is proved. 


Problem, Prove the Hamilton-Cayley theorem: Every matrix satisfies its own 
characteristic equation, i.e., AZ — A reduces to the zero matrix if ) is replaced 
by the matrix A. 


44. Elementary Divisors 


LEMMA.’ Suppose a square matrix P of order n contains a rectangular 
submatrix Q of height a and width b (i.e., with a rows and b columns) whose 
elements are all zero. Then det P = Oifa+b>n. 


pp eet ee Proof. We can always move Q 
into the upper left-hand corner of P 


without changing the absolute value 
of det P. Then P has the structure 
shown schematically in Figure 25. By 
Laplace’s theorem,® the determinant 
of P is a sum of products of minors 
of P of order b made up of elements 
in the rectangular matrices Q and 
Q, with corresponding signed minors 
made up of elements in the rectangu- 
lar matrices R and R,. But since 6 > 
n — a, each of these minors of order 
b contains at least one row consisting entirely of zeros. Therefore 
det P = 0, as asserted. 


FIGURE 25 


’ For the explicit formula, see e.g., V. I. Smirnov, Linear Algebra and Group Theory 
(translated by R. A. Silverman), McGraw-Hill Book Co., New York (1961), p. 30. 

4 Suggested by S. L. Sobolev. 

* See e.g., V. I. Smirnov, op. cit., p. 17. 
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Now consider the matrix 


M = 


where 


M = 


- 


fos hy 


} 


Ley 
io 


The matrix M, (s = 1,....4) is of order ,, and the numbers ¢,;, .,..., 
E,,-1 are all nonzero. Some of the numbers 2, 22, .... 2, May coincide, and 
all elements except the ¢,; and % — 4, are zero. The matrix M will be recog- 
nized as the characteristic matrix associated with the canonical form (6.6) 
of the system of differential equations (6.7), except for a slight change in the 
notation used for the off-diagonal elements. 
Obviously the determinant of Af equals the product of the determinants 
ol the submatrices A/,, My... .. Af,, and hence 
det M aie det N/, det M, re ea det MM, aa (% mazes Dy yh —— do)? eas (A vs a er 
Therefore the results of the preceding two sections imply 
THEOREM 1. The nunibers 34, has... hy, appearing in the equations 
(6.6) are the reots of the characteristic equation of the system (6.7). 
Next we find the g.c.d. of all minors of order / of the A-matrix AM: 
THEOREM 2. Let D,(d) be the g.e.d. of all minors af order ie te 
lo... nm) of the matrix M. and let m, denote the sum of the orders of all 


the submatrices. M, with diagonal element > — 2°. Moreover, let m, 


’ bon Oe . lich Oe, . ft: . +e 4 oA a SS 
be the largest order of all submatrices with diagonal elements A— 1, 
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let m be the second largest order of all such submatrices, and so on. 
Then, to within a constant factor, 
m : int) 


D2) = TT @.— 20mm 
ieee oo . "i ; 


ts] 


where the distinct values of dy, ba, «+, Ay are denoted by 
BO is a RO (i << k). 


Proof. The fact that the expression for D,(A) is valid for / = has 
just been proved. Thus let / = — I. Since a common divisor of all 
minors of order / of the matrix 4/7 must also be a divisor of det /, we 
have 

DAA) = (i — AMG — YP — AMY, 


where the p; are appropriate nonnegative integers (p, < mi,). Suppose 
that from the matrix AZ we delete a row and column which intersect 
outside all the submatrices A/,, as shown 
schematically in Figure 26. Then the re- 
maining matrix Af‘ of order n — | contains 
a rectangular submatrix Q, the sum of 
whose height and width equals 2 (@ 1s the 
shaded area in the figure, for the case where 
the deleted row intersects A/,). Therefore 
det." -- 0, by the lemma, and hence to 
find the g.c.d. D,_,(7) of all minors of order 
FIGURE 26 n— | of the matrix MM, we need only con- 
sider matrices obtained from 7 by deleting 
rows and columns intersecting inside one of the matrices M,, say A... 
Clearly the determinant of such a matrix Af’ equals the determinant of 
the matrix M/, obtained from M@, by deleting one row and one column, 
muiuplied by the determinants of all the other matrices A7,. Moreover, 
in finding the g.c.d. of ail minors of order » — | of Af, the minors of 
key importance are those containing the smallest powers of the factors 
AW. 2, But the smallest power of % — 4, in the deter- 
minant of M/ is obtained by deleting the first row and the last column, 
since then the determinant is just the constant 


= = & 
“s,lmsgt “4 7s), =) 
1 


which is nonzero, since all the €,,, are nonzero. In this case, A — As, 
appears in det M° toa power an, less than in det M. Therefore the smallest 
power of 4 — 7!" which can appear in all the minors of order a — | of 
the matrix M equals 


(2 seek 3! i ome 


3 
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and hence 
TH 


Dyualh) = TI — 2Pyrcm” 
int 


This completes the proof for / = mm — 1. Similarly 


m 
{21 


TT 7 tt} 
D, ar = [| (7. oo yes “In, 
int : 


and more generally we obtain the formula given in the statement of the 
theorem. 


DEFINITION. The factors 
(2, — layne” 
are called elementary divisors. 


According to Sec. 43, the g.c.d. of all minors of order / of the characteristic 
matrix 7E — A corresponding to the system of differential equations (6.7) 
is the same (to within a constant factor) as the g.c.d. of all minors of the 
characteristic matrix if corresponding to the system after reduction to 
canonical form. Therefore AZ — 4 and Mf have the same elementary divisors. 
In particular, if #2 — A has several identical elementary divisors, so does VW. 
Thus the structure of the canonical form of the matrix 2£ — A can be in- 
ferred from a knowledge of its elementary divisors and their multiplicities 
(the number of times each occurs). Only the off-diagonal terms in the matrices 
M, remain undetermined, and as we saw in Sec, 42, they can be given arbi- 
trary nonzero values. 


45. Determination of a Fundamental Set of Solutions of a 
Homogeneous System 


LEMMA. The m vector functions y(x),... VMN) sare linearly 
independent if and only if the vector functions 2 (x) = AVX), ..-. 
20x) <= Ky) obtained PODER) cae VON) Oy applying a 
nonsingular linear transformation K are also linearly independent. 


Proof, Suppose there are constants Cy,.... C,, not all zero such 
that 


Then 
nt | rab a 
S Cky(a) = KS CuO) = 0. 


? fesl 
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and hence premultiplying by K~? we obtain 


= C,y'"(x) = 0 


i=1 
which contradicts the assumed linear independence of the functions 
yx). The converse is proved by interchanging the roles of y‘(x) 
and z!(x) and those of K and K-+, 


Now consider the homogeneous system 


? 


or in scalar form 


oS Ai iV jr = 1,...,%. (6.13) 


j=1 
Then, as we have seen above, the elementary divisor (A — ,)”s of the matrix 


AE — A corresponds to the following group of homogeneous equations in 
the canonical system (6.6), involving nonzero constants €,, &,...; Ep 4: 


AZpiy " 
== AZ, 
dx 
dz 
k+o | 
a BZ yy H+ AsZe 2 
dx 
dz 
k+3 
= E0Z po TP AsZp sas 
dx (6.14) 
dz 
k+Ds—1 
7 = En 2S et g—2 a K s2pms-1 
dx. 
AZps ng 
eS A ee tes AZ iidg* 
dx 


Suppose we introduce new unknown functions by writing 


Zi = Zire . 
where in general A, is complex, with real part 4, and imaginary part 4%." 
® Thus e4s® means 
est(cos xx + isin Xx) 
(Euler’s formula), and 
d ~ ? rf é . * 
op erst =z 20%s*(cos Nx + isin Mx) + ers*(—2 sin Nx -+- IF cos 4X) 


e: (N+ 1X) et (cos Xx + isin Mx) == Aer, 
just as in the case of real > 
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After making this substitution, we find that 


i 
Ce eat 
et 


‘ is 0, 
ax 
ee os * 
= ~1 he dy 
dx 
ao 
TT ae Bs ay 
dx 
Cito) os 
+ py 2k > pyod? 
dx 
dors, * 
: = Zp —12 hep ds 
dx 


The solution of these equations is 
-*§ of cl 
Spey = Cy = Cy’. 
Substituting 2/°., into the second equation and integrating, we obtain 


* f pl ‘ 2 
ce ON aoe Ore Oat ee Oa 


Then substitution of z*,, into the third equation gives 


Cie Sa. s 3 
Sieg ET A Catgx + Cs = CE x? + CPx + CG", 
' 2&2 
and so on. up to 
3 ’ ps : per) Meo ‘ : F: * i eh 


Here the C’s with various indices always denote real or complex constants. 
Finally, transforming back to the variables 2.4, Zea e+ ++ Zee; x. We have 


Cd eet Ast (1) 
SEag =e Cy 


Pies ef / Crs as eat 


eee OP CGN ee Coys ih (6,15) 


~ a (3 ‘ “Lt ép,) a a -_ ~(p,) 
poe (Ck eG ee Ca) 


The equations (6.15) ) give the general solution of the system of differential 
equations (6.14) and moreover, according to Theorem ; Se ¢. 33 the system 
has p, linearly independent solutions of the form (6.15). We can satisfy the 
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entire eee canonical system by setting all the z, equal to zero 
exer those appearing in (6.14) and (6.15). Since the y,; are lmear combina- 

tions of z,()=1,..., 4), it follows from the lemma on p. 139 that the 
system (6.11) has p, linearly independent solutions of the form 


corresponding to each elementary divisor (2 — 4,)?* of the matrix AE — A, 
It is easy to see that we can assume that 


ony . 1 ree, 
Gr =0. tor 70 bur DIG SO 


i=] 
in the first of these solutions, corresponding to z,,. =¢°° = Zpsp, = 9, that 
i ( nr . 
iy * ir 
C,"=0; for gek “but De "|S 0 
i=] 


in the second of these solutions, corresponding to 2.5 = °° * = 22, = 0, 
and so on. 

If the matrix 7»£ — A has several elementary divisors, equal to various 
powers of 2 — 2"), say 


(7, a) ep (7, =e ed ae ea ta ee as 
then the system (6.11) has yu, -- +--+ yu, linearly independent solutions of 
the form 
wy Mel , 
Vee Ge i= 1 ni, 
3=0 


Where Af = max (u4,.... 4), Which can be found by substituting these 
expressions into (6.11), dividing by e*”’= and then comparing coefficients of 
identical powers of x.? These solutions are linearly independent of those 
found in the same way for the elementary divisors involving powers of other 
* — 2), because such elementary divisors are associated with other groups of 
equations in the canonical system. 


Remark. Suppose the system (6.11) has real coefficients a,,, and let y@) 
be a complex solution of (6.11). Then Re y(x) and Im y(x) are also solutions 
of (6.11). Moreover, given n linearly independent complex solutions yO), 
k= 1,...,nof(6.11), there must be x linearly independent functions among 
the 2 functions Re y(x), Im eG), koe ya (why ?). Therefore a 
homogeneous system with real coefficients always # 1as a fundamental set 
of real solutions. 


ti} 
*In other words, we use the method of undetermined coefficients to find the C;, ue 
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Problem I, Given a square matrix Al of order n, we define 
ef sz A A ae AP i me AM eee, 
: nt 


Prove that this series converges and that the matrix e! has the following 
properties: 
etiAgtad = gltitzl gf ae Fev A = (eA), 


Show that the matrix e*4 is differentiable with respect to x and (e74)’ = Ae=+. 
Prove that if A = K7'BK, then e+ = K~e?K, Find an expression for e74 in 
powers of a matrix with (ordinary) Jordan canonical form. Prove that if the real 
parts of the characteristic equation of the matrix A are less than some number 
—o« <0, then 

le Ay < Me7* 


for x > 0 (see Prob. 1, Sec. 32), where the constant .W depends on the choice 
of the matrix A. 


Problem 2. Prove that the general solution of the system (6.11) is of the form 
y — etl Cy 


where C is an arbitrary constant vector. Prove that the solution of (6.11) 
satisfying the initial condition y(x,) = y° is 


- = plt—Z)-1 vy" 


46. Implications for Homogeneous Equations of Order n 


We now consider the system (5.9), which is equivalent to the single 
equation (5.8), assuming that all the coefficients ap(x),...,. @,-,(x) are 
constants @,..., @,—,. Then the characteristic matrix of (5.9) is 


(6.16) 


r —| 


“Sg “=Ayp “Qe: dy "6 = —Gy_2 ae Gyo 


It is Casy to see that this matrix has the determinant 


M(X) aa A" — er is = a ee aX — ly. (6.17) 


[44 
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By deleting the first column and the last row of (6.16), we obtain a matrix 
whose determinant equals +1 or —1. Therefore, according to Theorem 2, 
p. 137, if the polynomial M()) has a root 2, of multiplicity p,, the matrix (6.16) 
has the elementary divisor () — 4,)": and no other elementary divisors involving 
powers of 2 — d,. 

It now follows that a root 2%, of multiplicity p, is associated with p, 
linearly independent solutions of the form® 


; 2 , ! oy t ei —1 va x f 
(Cys eC eta Cy le, (6.18) 
which can clearly be chosen to be 
ene xen, xretF, sete x slg Art 


In fact, if these functions were linearly dependent, then some expression of 
the form (6.18) with at least one nonzero coefficient C; would vanish identi- 
cally. But this is impossible since e’:* never vanishes and a polynomial cannot 
be identically zero unless all its coefficients vanish. 


Remark. Suppose the coefficients a; are all real, and let (A — 4,)?: be an 
elementary divisor of the matrix (6.16) where %, is complex. Then (6.16) 
also has the complex conjugaie elementary divisor (A — 4,)?*. Moreover, If 


y(x) = x*e* = x*e*(cos Box + isin 6.x) 
(2, = 4, + 2.) is a solution of equation (5.8) with constant coefficients, then 
SO 1S 

ty = xFehet x*e**(cos 6.x — isin §,x). 
Therefore the real functions 


¥(X) -F F(X) 


yee ee YS gh 
=x 6 COS. OLX, 


y(x) — FO) 


== x e% sin | GX 


both satisfy (5.8), and in this way we obtain x linearly independent real 
solutions of (5.8) [why ?]. 


Problem 1. Show that if the characteristic matrix of a system of w first-order 
linear equations with constant coefficients has the property in italics above, 
then there is a nonsingular linear transformation reducing the system to the 
form (5.9) equivalent to a single equation of order 1.9 


hese soluuions are also linearly independent of the solutions associated with other 
roots of M(z), since the latter solutions correspond to other groups of equations in the 
canonical form of the system (5.9). 
* In other words, we now have a necessary and sufficient condition for a system of a 
first-order linear equations with constant coefficients to be equivalent (in the indicated 
sense} to a single jinear equation of order ” with constant coelticients. 
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Problem 2. Find the general solution of Euler's equation 


i dnoty dy 


pt EO AN 
” dx 


+ ayy —= 0, 


where all the a; arc constant. (Introduce a new independent variable ¢ by 
setting x = -te'.) Note that we can also handle the case where every power of 
x is replaced by the corresponding power of ax + b. 


Problem 3. Find all solutions of the equation 
y (x) = av(x) + by(e — x) 
which exist for —oo <x < « (a, b and c are constants). 


Problem 4. Derive all the basic properties of the functions sin x and cos x 
by regarding them as the solutions of the equation »” + y = 0 satisfying the 
initial conditions }1 29 ° 0, ¥|ro9 = Land yl.9 = 1, y’ es = 0, respectively. 


Problem 3, After writing 


d'f laud & df 


Lif] — det —~ Gy dx? Se ae aa de = dy f, 


deduce the formula 


a i i 


; MMO) dy 
Less] = 0 Meo f+ MAS Sane rear 


li! dx nl dxty” 


Use this result to obtain the general solution of equation (5.8) with constant 
coefficients. 


47. Determination of a Particular Solution of a 
Nonhomogeneous System 


In studying nonhomogeneous systems, we confine ourselves to the case 
Where the functions /; appearing in (6.7) are of the form 
mn 


/ . ws Seer fey, 
ii: Coe 
kel 


with real or complex «,, C%* and nonnegative integral 8. Obviously, it is 
sufficient to Study the case wr = 1, since the particular s solution in the general 
case will be a sum of soiniien® corresponding to this special case. Thus we set 


fia Ce. 


and then write the group of equations in the system (6.6) corresponding to a 


146 


given 


dz 


LINEAR SYSTEMS! THE CASE OF CONSTANT COEFFICIENTS 


CHAP, 6 


elementary divisor (A — 4)” of the characteristic matrix AB — A: 


ket} _. Sas ot yy st 
jel ae AsZn. 1 + Criax’e 
dx 
dE p49 + - -{+ Cc x? git 
eeennieneeenneneentn omens yp L i As= pe ke € 
dx 
az. 4 ' *# er $4 
at : —? oe —o . +4 3 
dx 
= . . + a ? * * ba 5 : . . 
* 
-~ +! 
dz), as —_ bed ocd } e, -l ce xP pr 
_ = Syn, Loki py) 6 sehen, | AT Ds" 
dx 


Here the C* are certain new constants. Using the relations 


% Pars 
i 8 


to introduce new unknown functions =? 


i=k 1, + Dp, 


, we find that 


te" 
t 2 pos. oa = “~ 
kel __ Co Bol Asia : 
ax 
diner si ba oe i a ye git Ae 
wr ered Ie ? 
dx 
oF | 4 (6.19) 
—E~-G cd ee ——— eC uaxte 
ax 
Aes, P | oer 
¥? ee i eee ‘pi oP oe 
= Spy-in ksi] A Ge Po e . 
dx 


In Integrating this system, two Ca 
whether or not 4, equals he 


ses should be distinguished, depending on 


Case 1. Vf 4, 3 % we can integrate the equations (6.19) step by step, 


obtaining 


* 
ed ” 


a 


Scememad 
come 


where cach Af'(x) is a polynomial in 


follows that 


~ 
Pee 
: 


= MP(x)e™, 


™ According to complex variable theo 


for reul « — %, remain valid if « — 2, ts 


Mi x)e' aA 


ek a oe ee 


k t 
im oes 


210 | 
x of degree no higher than f. It 
jek - et FS are (F 


Bala’) 
y, the formulas obtained by integrating xre 


pears This can also be verified directly. 
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If none of the 2, equals «, then all the z, (i= 1,...,) are of the form 

Z, = MP(x)e™, 
and hence the particular solution y, is also of the form 

ya My le (6.20) 
The coefficients 4*(x) can be found by substituting (6.20) into (6.7), 


dividing the resulting equations by e*, and then comparing coefficients of 
identical powers of x. 


Case 2. lf A, = « the system (6.19) becomes 


dzt 
“hel * if 
dx Cit 
az, ‘ ee 
= ; = 512K 41 “PCy aX", 

ax 
dz . ge 
k+3 _* * Of 
a. = Eaexie + Cya3X" 
. * 

°F 
din. y, nan = x? 


= y,—-1" Jit py] + Crep x 
ax 


Integrating these equations step by step, we obtain a particular solution of the 
form 


Zee (X) = M!® (x)x*, Peli aaee Di 
where each M‘), is a polynomial in x of degree no higher than /. It follows 
that 
we) = MO he*, i= 1,045 Pe 
Therefore the system (6.7) has a saetieies solution of the form™ 


yx) = MES e*, i= 1. Ms 


where each M*'**?) is a polynomial in x of degree no higher than B+ p, 
and p is the highest degree of an elementary divisor of the form (A — a) of 
the characteristic matrix AE — A. 

Finally we consider the implications of the above fae for a single 
equation of degree n. Suppose « is a zero of order p > Oof the po lynomial 
(6.17) introduced on p. 143. Then the equation 

d"y aes y 


; dy dy da: 
j i = il ale ie “> al cisits sone ae - os 2 6 -- a l exten te ay! y+ hi XxX Sp (6. 2 1) 
ax" ‘ 


nad ax 
‘It would be incorrect to think that the system (6.7) must have a particular solution 
of the form x8Af*GY(x)eaz, since besides the given SfOup of equations corresponding to 
m= &, there are other groups corresponding 0 Ay 
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has a particular solution of the form 
v(x) = MET (x)e™, (6.22) 


where Af‘#=*)(x) is a polynomial of degree no higher than 6 ++ p. By subtract. 
ing an appropriate par ticular solution (6.15) of the homogeneous equation 
from (6.22), we can obtain a particular solution of the nonhomogeneous 
equation of the form 
Wola Mn oe 
Problem 1. Use the result of Prob. 5, Sec. 46 to find a particular solution of 
equation (6.18). 
Problem 2. Prove that the general solution of the system (6.1) is of the form 


ys ef fC + | 


a: 
whe 


gleoeelr (sys, 


where C is an arbitrary constant vector. Prove that the solution of (6.1) satis- 
fying the initial condition yxy) = y° is 


= git~to)A y? -- |“ seam a ES) as. 
- * Ty 


* 


Problem 3. Using the last result and Prob. 3, Sec. 19, prove that if every 
solution of the system (6.11) is bounded as v -- ©, then so is every solution of 
the system 
w= [4 + BX]y +O), 

where 

* nape 8 

2 'BOidx < &, I. Iflx) dx < 

6 

[the matrix B(x) and the vector f(x) are esnteuet Moreover, prove | that if 
every solution of the first system approaches zero as x -r # , then so does every 
solution of the second system. Find conditions under which analogous assertions 
hold for linear systems with variable coefficients.?* 


7 , ax+db 
48. Reduction of the Equation y’ = ae = 
to Canonical Form a 
We now consider the equation 
dy _ ax + b ve (6.23) 
dx ex-+d yo 


with real coefficients a, 6, ¢ and d. Equation (6.23) Is eq uivalent to the system 


dx ) dy (6.24) 


=ex-+dy, = = an + by 
dl di 


* For a number of results along these lines, see R. Bellman, op. ¢H. 
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involving the auxiliary variable ¢. Depending on the elementary divisors of the 
A-matrix 
A coma ot —<d 
(6.25) 


three cases can arise in reducing (6.24) to canonical form. 


Case 1. Suppose there are two real elementary divisors +, — 2, and — ds. 

In this case, according to Sec. 42, there exists a nonsingular linear transforma- 
tion 

x* = kyx + Kypy, yt = kayx + Koy (6.26) 


with real coefficients which reduces (6.24) to the form 


ot ys 
ibs a Se ay" ae a (6.27) 
dt dt 
If the determinant 
God 
(6.28) 
a ob 


is nonzero, then A, and A, are both nonzero, and equation (6.23) takes the 
form 


dy* = dop* 


after making the transformation (6.26). 


Case 2, Next suppose the matrix (6.25) has two complex elementary 
divisors k — dh, and } — 4g. Then A, and A, are complex conjugates, since the 
coefficients a, b, c and d are real, and the transformation (6.26) again reduces 
the system (6.24) to the form (6.27). The coefficients k,, and ky. are now com- 
plex conjugates of the coefficients £.,; and k., respectively. In fact, since 
Ag = A, (the overbar denotes the complex conjugate), it follows from the 
first of the equations (6.27) that 


and hence we can choose 


Introducing the notation 
Ay = a -f if (3 = 0), 
. : en op. 
Ky = Yh ~- is, kyo as is a 102, 


oe ee * ~ , a ES ’ 
SN + a) qa 8x bay 
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and taking real and imaginary parts in (6.27), we find that 


iad ; 
ads OF ao 
= 22 — By, = = BE + on, 


which implies 


{t should be noted that the linear transformation carrying x and y into © and 
v is nonsingular. since otherwise we would have 


Se Kyo 
== 0. 


Case 3. Finally let the matrix (6.20) have one elementary divisor (4 — 44). 
Then, according to Sec. 42, there exists a nonsingular linear transformation 
(6.26) with real coefficients which reduces (6.24) to the form 


Wee nw dy* ror 
aN, NT ee ny ye, (6.29) 
ai (lt 


where ¢ Is an arbitrary nonzero number. If the determinant (6.28) is nonzero, 
then 4; = 0. Moreover 4%, is real, since the coeifficients a, b, ¢ and dare real. 
Choosing = to be real, we find that the coefficients k;; are also real (recall the 
considerations of Sec. 42). For example, suppose = = 4,. Then (6.29) implies 


dy / an Panne / iv" a a oe 
hes FO ad 


Problem 1, Suppose equation (6.23) has a focus at the origin, Under what 
conditions (involving the coefficients a, 6, c and @) do the spirals wind in the 
counterclockwise direction as they approach the singular point? Under what 
conditions do they wind in the clockwise direction ? Solve the analogous problem 
for a node (of the type shown in Figure 15, p. 69). 


Problem 2, Suppose equation (6.23) has a saddle point or a node at the origin. 
Along what directions do the Integral curves approach the origin in the case. of 
a saddle point? Along what direction does the infinite family of integral curves 
approach the orivgin in the case of a node? If the origin is a center, locate the 
principal axes of the ellipses. Which axis is larger? 


49. Stability of Solutions 
Suppose the initial data are specified for x = xy. Then a solution 


J; ea v(x), Pe l, ss H (6.30) 
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of the system" 


dy; oe: | 
} == f(x, Vi ee aaa Vrs i= I, se # & i] (6.31) 

dx | 
is said to be stable (in the sense of Lyapunov) as x —+ + co if given any < > 0, 
there is an 7#(c) > O such that an arbitrary solution yx), f= 1,..., 2 of 
(6.31) satisfies the inequality 


Ips) — viG)| <e 


for all x = Ny, provided that 


% = 0 we = 
_ (Xo) — ¥i(Xo)| < aCe). 

If in addition 

7 * . 0 « , 

lim [¥,(x) — y,(x)] = 0 

ot og 

for sufficiently small |y,(%9) — ¥$(Xo)|, = 1,..., 7, we say that the solution 
(6.30) is asymptotically stable (in the sense of Lyapunov) as x ~ + x. Here, 
of course, it is assumed that the functions y?(x) are defined for all x > x, and 
that the system (6.31) is defined in some “neighborhood” of the curve 
Vee Oat ely-een gat OF thevorm 


Xo Xp [ve vitx)| <M, eS aes) 


Obviously, we need only consider the case 
VAX) = Q, j= | rans | 


since the 1',Cx) can always be replaced by new unknown functions 1(x) — yf(x). 
Finally, we assume that x and all the functions /;, 3; are real. 


LemMaA (Lyapunov’s lemma).") Suppose that for some =) > 0 the 
functions f(x, Vy... ¥,) are defined and continuous for Xy< xX < 2, 
fy SV, << ep P= t,... an, and satisfy the conditions 


JAM cies) = 0, BSS ge ars Me 


Suppose further that for the same values of vy; there exists a continuously 
differentiable “Lyapunov function” V(yy. 2... 3a) 2 O equal to zero only 
at the origin, such that 
zg) | 
S ——f, < 0. (6,32) 
Ans 3 
j=l Oy; 


MWe now return to the general system (4.4). 

"The pioneering work on stability of solutions of differential equations is due to A.M. 
Lyapunov, Probleme general de la stabilité du mouvement, Ann, Fac, Sci. Univ, Toulouse, 
9, 203 (1907), reprinted in Annals of Mathematics Studies, No. £7, Princeton University 
Press, Princeton, NJ. (19-47). 
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Then the null solution y(x) = 0, f= 1,...,4 of the system (6.31) is 
stable. Moreover, the null solution is asymptotically stable if 


e OV 2 | - 
a oY I; 0 C per oVa) S 0 (6.33) 
j=1 OV; 


for the same values of ¥y where Wh... +. 2a) > 0 is a@ continuous 
function equal to zero only at the origin. 


Proof. Let 0O<e< 9 and let K, denote the surface of the n- 
dimensional cube 


Moreover, let 


where clearly , > 0. Then choose 4 > 0, 4 < <so small that V < F, 
on A, and everywhere insice A,. Such a value of y exists, since V is 
continuous and vanishes at the origin. Since V is a composite function of 
x along any integral curve /, it follows from (6.31) that 

aVdy,; wmaV 


dx  jzidy;dx A ay; ~ 

along /. Thus the condition (6.32) means that V cannot increase along / 
as X increases: Suppose that as x increases, / starts at a point x = Xx) 
inside A, and subsequently reaches K, for the first time at some point 
xx). Thed 


V lowe < Me < Vly. 


rt Iq 


along /, contrary to the fact that V cannot increase along /. This contra- 
diction shows that no integral curve which begins at a point ¥ = %p 
inside A can ever reach A, as x—> +m, thereby proving the stability. 
If the stronger condition (6.33) is satisfied, then in addition to being 
a noninereasing | function of x along /, V approaches zero along / as 
X—- +o. In fact, suppose does not approach zero as x > +0. Then 
Vv exceeds some positive constant everywhere on /, i.e., V hes entirely. 
outside some sufficiently small cube K, It follows from (6.33) that 


Tee So ee ee 
along /, since W> %>0 (x == const) outside K,. Integrating tis 
inequality, we find that 


Mex) < V(xq) — 9(x — Xo), 


where the right -hand side a aac —% as x —» ++ 0, contrary to the 
definition of V. Therefore V 0 al ong / as x —- +00, But then every 
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integral curve / beginning at a point x, inside A, not only stays inside 
K,as X— --, but actually approaches the origin, thereby proving 
the asymptotic stability. 


Remark J, We emphasize again that the left-hand side of (6.32) or (6.33) 
is the derivative dV/dx of the function V evaluated along integral curves of 
the system (6.31). 


Remark 2. Lyapunov’s lemma has 
a simple geometric interpretation, as 
illustrated by the following special case: 
Let n= 2, and suppose the curves 
f= C(C = const) are closed and con- 
tain the origin. Moreover, suppose 
C < C’ (see Figure 27). Then the con- 
dition (6.32) means that no integral 
curve sharing a point with the curve 
¥ = C can leave the domain bounded 
by K=C. It follows that the null 
solution y, = 0, ). == 0 (the origin of FIGURE 27 
the xy-plane) Is stable. If the stronger 
condition (6.33) holds, then the integral curves cross the curve ¥ = C from 
the outside to the inside, since 


dv 


het THe sk @>O. 


Moreover, according to the lemma, V0 as x—» +c, Therefore every 
integral curve approaches the origin as x-> +o, ie., the null solution is 
asymptotically stable. 


THEOREM. Suppose the functions f, figuring in the system (6.31) 
satisfy the conditions of Lyapunov’s lenma and are of the form 


n 


TAX, Vi: a Va ae > aN; 5 PAX, Vi. eee i. 
jel 


where the a;, are constant, and the real parts of all roots >. of the character- 
istic equation 
~ LOD, 
det (AL =F A) ot 0 (6.34) 
are negative. Suppose further that 


EAN Mis eens idl < M([y,['** 5 ae va (6.35) 


for all x xX, and sufficiently small |y,|, where the functions F; are 
continuous (in all their arguments jointly) and M, & are positive constants, 
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Then the null solution 
v(x) = 0, H — ki se ny ai 


of (6.31) is asymptotically stable as X — + ©. 


Proof. It should be noted that the conditions of the theorem are 
satisfied if the functions f; are independent of x and have continuous. 
first and second derivatives im some neighborhood of the origin, and 
if the real oe of all roots of equation (6.34) are negative. In fact, we can 
then expand the /; in Taylor series, obtaining 


ri (2 | 
Ieee iS 2 sj - | S yi), 
j=l i 
Turning to the proof itself, we first consider the particularly simple 
equation 


—— = 2y— = —2lay 
ax dx 


and hence (6.33) and all the other conditions of Lyapunov’s lemma are 
satisfied by setting 
My) =v, Wy) = 2ay*. 


This suggests that we subject the system (6.31) to a nonsingular linear 
transformation reducing the “linear part’? of the system to canonical 
form, and then take the Lyapunov function to be the sum of the squares 
of the absolute values of the new unknown functions. Pursuing this 
idea in detail, we make a nonsingular linear transformation 


i a rae 
; j=s} 
reducing the system 
5 ae 
seeing eee LZ, ays ¥ 5: i= l, a ecg 
dx jz} 


to canonical form. confining ourselves to values of the variables 2; (in 
general, complex) for w hich the corresponding values of the y; are real. 
Next we write 


Vy, ve ey ie) a > |Z, = 2 Fite 


where, of course, every z, on the right is sailed as replaced by is 


** For the meaning of the symbol O, sce footnote 37, p. 67. 


SEC, 


49 LINEAR SYSTEMS! THE CASE OF CONSTANT COEFFICIENTS [55 


expression in terms of the y,. It follows that 


v 


d aa ae dz; ea dz, 
dx we dx 


To calculate these derivatives, we consider the group of canonical 
equations corresponding to some elementary divisor, say (A — 4)", 


of the matrix AE — A: 


dz ; 
pan i FU (x, saree F 


dx 
Ge a a 


“f 7 
as 


dx 


; - & 
NGL Spee oe es iE - 
—_ Ik“ ny—} arid MZ, am as ne a 215 i a ae ae 


dx 
(6.36) 


Here A, 1s one of the roots of equation (6.34), and §, is an arbitrary 
nonzero number. If the functions f; satisfy the condition (6.35), then the 
functions /\* satisfy a condition of the same form 


ao . |i Oo, catered 1+ 
[F; (x, 3 Z| < fs M*{|z | oe zal j? 
where A/* is a new constant. In fact, the F;* are face combinations of 
the F; with constant coefficients, and hence there are constants Af, and 


(6.37) 


M,., such that 
Zi 
PS Si cere ey IES Sie ea) 
al 


a Malye aes yal ds 


where every y; is a linear combination of the z,; with constant coefficients 


Therefore 
Wyte ac Afg(max [z,))P* Malley oF Deal] 


for every /, where Af, is a suitable constant and max [z,| denotes U 
largest of the numbers |z,|,.... [2,], and (6.37) follows at once. 
Using the equations (6.36), we find that 


dle lee sods; 
eget re 6.38 
dx dN dx (6.39) 
% Ae 
Fs L = cs iv a a aT GN, he 
— Ay=j4; -b Ay=,=; mt Riga “+ or ay-t4 i omy F; oj oe 


for i> 1. For ¢ = 1, we have the same equation without the terms in 
8, and %,. Writing 
Rea, =a; 
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CHAP, § 
and using the estimate (6.37), we deduce from (6.38) that 
d iz 2g =: Im te]? = 
—— = 2a, [zt + IBGE + Ise al] 
+ M*flz 8 +--+ + foal] max ef, (658 
i= Dig hy; 
rel ot . 2. aayn lez, 1 yo ylte ae mals 
——— «Jay [z,)° + 2M* fz.) 00+ +++ + fz.) 0°") max pais (6.39) 
adx 
But clearly 


Therefore, summing the inequalities (6.39) and " 39°) over all i from 
Il tow, and letting —a@ denote the largest real part of all the roots 3, 
(which is negative ‘ hypothesis), we find that 


Jy? tt 
ay 2 so ‘sie dea 
aD, [z,\° we —2a iG a BY oi 2M yee 


dx : eS) 


Where B is the largest of the numbers [8,|. Since the 8, can be made 
arbitrarily smail in absolute value, provided only that they are nonzero, 
we can choose them so that 


a 
B<-. 
4 


Moreover, let all the y,; and hence all the z, be so small in absolute 
Value that 


Then 
dV 


d x 


=< —aV, 


and setting W = aV, we see that (6.33) and the other conditions for 
applying Lya apunov’s lemma are satisfied. Therefore the null solution of 
(6.31) i is asymptoucaily stable as x —» +- co, as asserted. 


Remark. The stability property does not depend on the choice of Xp if the 
system (6.31) has a ee solution passing through each point of the curve 
ve YO), P= 1... ny X > xy. In other words, the solution }; = Jil), 
f= 1,...,0 of Hie ae (6.31) is stable for initial data specified at any 
fixed point x, > xy if and only if it is stable for initial data specified at % 
This is an immediate consequence of the fact that soutouS depend contin- 
uously on the inidal data in the finite interval Xo << X NG (see Sees. 19 and 
29, in particular the remark on p. 58). 
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Problem 1. The lemma proved in the text might be called the stability lemma. 
Prove the following fastability lemma, also due to Lyapunov: Suppose all the 
conditions of the stability lemma are satisfied, except that it is now required 
that V take positive values in every neighborhood of the origin and that 


a 
» rane b oe U(y,, sue »Vads 
j=1 OV; 
where U is a continuous function which is positive wherever V is positive. Then 
the null solution of the system (6.31) is wstable. 

Give examples showing that both lernmas fail if the function Vis allowed to 
depend on +. However, give further requirements guaranteeing that the lemmas 
remain valid. 


Problem 2. Using the instability lemma, prove that ifall the conditions of the 
theorem on p. 153 are satisfied, except that at least one root of equation (6.34) 
has a positive real part, then the null solution is unstable. 


Hint. Set 
ne a , 
Le 
VV cee ee), — > lz,i° ae > lz,!°, 
i=] =r) 


where z,,...,2, are the “canonical variables” corresponding to roots with 
positive real parts. 


Problem 3. Construct an example of a system (6.31) with only one stable 
solution, which nevertheless has a unique solution which satisfies arbitrary 
initial data and is bounded for all x. 


Problem 4, Show that even if the solution of the system (6.31) satisfying 
arbitrary initial Gata converges to the null solution y,(x) =0,f = 1,...,m as 
x -> +00, it is still not necessary that the null solution be stable (give an example). 
Suppose that in addition it is known that the null solution is stable. Then must 
all solutions with “sufficiently close” initial data also be stable? Analyze the 
cases n = | and > | separately. 


Problem 5. Show that if all solutions satisfying the condition 
Lidvo)] << M4 ah eee | (6.40) 


converge uniformly as x —» + © to the null solution, y; =: O,F = le... then 
all solutions satisfying (6.40) are stable. 
Problem 6, Prove that in the theorem on p. 153 and in Prob. 2, we can replace 
the condition (6.35) by the weaker condition 
FAX Ye Vn) = OE tt + ad. 
Gi 
FAX, Vues. Vn) 


Dil i Dal 


-- 0, — isaead! 


(uniformly in x) as all the 3; > 0. 
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Problem 7, Let n = 1, and suppose two solutions satisfying initial data 
¥(Xq) = VP and yxy) = y°* > y°) converge asymptotically to the same (finite) 
limit as x —» -- 2, Under these conditions, prove that if the initial data uniquely 
determine the solution, then every solution such that y®! < p(x) < y® ig 
stable. 


Problem 8. Find a necessary and sufficient condition for stability of the null 
solution of a system of homogeneous linear differential equations with constant 
coefficients. 


Problem 9. Prove that a necessary and sufficient condition for stability of 
the null solution of a system of homogeneous linear differential equations with 
continuous coefficients is that every solution of the system be bounded. 


Problem 10. Let 

aly’ - 
— = A(x)y (6.41) 
ax : 
be a linear system in matrix form with continuous periodic coefficients, ie, 
such that -l(x - @) = A(x) where a = const > 0. Let Y(x) be the fundamental 
matrix of the system (see Prob. 1, Sec. 33). Prove that ¥(x + a) = TY(x), where 
Tis some constant matrix. How is this matrix affected by changing the funda- 
mental matrix ? Find conditions for stability and asymptotic stability of solutions 
of (6.41), expressed in terms of the properties of the matrix 7. 


Problem II (M.A. Krasnoselski and S. G. Krein), Suppose the functions /; 
figuring in the system (6.31) are defined and continuous for all x, <x <4, 
Ao Vp 4,0 = 1,..., 4. Moreover, suppose there exists a continuously 
differentiable function V(1,...,4,,), defined for all y;, such that 


and @ continuous function @(17) > 0 [min /(y) < V < «&] such that 


[ “ al Vv + d ye “ H Va 
: = of naar c (p) . 
J0 ¢ D/ {7 ) - " an ey; i : ( ) 


Prove that any solution defined for x = Ngo Xy << Xy “<< & can be continued in 
the direction of increasing x onto the interval A A ee SS | 

Next suppose the functions /; figuring in the system (6.31) are defined and 
continuous for alla, cox -o x, eg te Pp ef Sd eta dhs ey 0 where all 
the f(x, 0,...,0) 0. State an analogous theorem giving conditions making i 
impossible for an integral curve beginning for x = xy (x, < Xp << Bata point 
other than the Origin of }-space to arrive at the origin for some finite value of x. 
(At the same time, the theorem gives sufficient conditions for the uniqueness of 
solutions beginning at points of the axis VMpoe tts yy, = 0.) Try to combine 
this theorem and the theorem in the preceding paragraph into a single theorem 
on the impossibility of an integral curve leaving some domain of espace fora 
finite value of x. 
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50. A Physical Example 


Consider a particle of mass mz: > 0 moving along the x-axis, subject to a 

resistive force 

ax 

= (a = const > 0) 

di 
proportional to its velocity (due to the surrounding medium, e.g., a liquid or 
a gas), a restoring force 

—OX (b = const > 0) 
applicd, say, by a spring obeying Hooke’s law,!® and an external periodic 
force which at time ¢ equals 
A COS Gof, 


where A and @ are real constants (@ > 0). Then the differential equation 
describing the motion of the particle takes the form 


dx dx 
ne + a + bx = ACOS wl. (6.42) 
dt” dt 
First we study the case where A = 0. This is the case of no restoring force. 
corresponding to “free oscillations’ of the particle. Assuming that the roots 
A, and A, of the characteristic equation 
mh - an b =0 (6.43) 
are distinct, i.e., that 


d°x dx oe 
m—+-a—t+b=0 (6.44) 
dt” dt 
Is given by the formula 
NOo== = C,e*" i Cre. (6.45) 
Ifa >> 0, the real parts of A, and %, are negative, and hence, by inspection, 


every solution of equation (6.44) converges to zero as f—> -+ 0, By the same 
token, every solution of the system 


dx : 
Sek ee. 
a (6.46) 
dx 
mpw—t es —bN — ax, 
dt 


'* According to Hooke’s law, the elastic force exerted by the spring acts in the direction 
of the equilibrium position of the particle, and is proportional to the displacement of the 
particle from its equilibrium position. 
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corresponding to (6.44), converges to the null solution x( = 0, x()-= Oas 
¢—» +-oo, This can also be interred from the theorem on p, 153. 7 

On the other hand, suppose a = 0. Then every real solution: of (6:44) is 
given by the formula 


eeinainaainasiss 
nd 


7 % . /, ‘ / a ane nae : 
x(t) = C, sin Vb/mt + C, cos ¥ b/m t = C sin (J b/m t+ 6) 


’ 
where 
i. i} . 
C= \! a ot as oF == CCOS 0, C. = & Sin 0. 


It follows that 


ety = = Cy bj cos (V bin t + Q). 
t 

Therefore the point (x(/). ¥,(4)) moves in an elliptical orbit in the XX,-plane, 
where the axes of the ellipse lie along the coordinate axes. The ratio of the 
semiminor axis to the semimajor axis is the same for every ellipse, i.e, ¥ bim, 
and the origin is clearly a center for the system (6.46). Moreover, the particle 
oscillates along the x-axis, with period 2x\/m/b which is the same for every 
solution of (6.44), 

Next we consider the motion for a > 0 in somewhat more detail, distin- 
guishing three cases: 


3 


Case 1. If a* > 45m, both roots of the characteristic equation (6.43) are 
real and negative. Clearly, neither the function x(1) given by (6.45) nor its 
derivative can vanish for more than one value of ¢. In particular, x(2) has 
no more than one maximum or minimum. In this case, the origin of the 
xX,-plane is a node of the system (6.44), as in Figures 12 and 13, p. 68. 


Case 2. If a* = 46m, the solution of (6.44) is given by 
Koz evstem(C, + Col). 


As before, neither of the functions x(r) and x,(/) can vanish for more than 
one value of 7. The origin of the xx,-plane is again a node of the system 
(6.44), of the type shown in Figure 15, p. 69. 


Case 3. If a? <4bm, the roots of the characteristic equation (6.43) are 
conjugate complex numbers, of the form 


Mist Soe Ae (u= 4 >0,8>0), 


and the real solutions of equation (6.44) are given by 


x= e "(Cy sin St 4- Cy cos 81) = Ce™™ sin (Bt + 9). 


The particle oscillates along the x-axis with exponentially decaying ampli 
tude Ce~**, and with period 2 8 which is the same for every solution of (6.44). 


SEC, $0 LINEAR SYSTEMS! THE CASE OF CONSTANT COEFFICIENTS  [6l 


Finally, we analyze the case where 4 =< 0 in equation (6.42), It is now more 
convenient to start from the equation 


“ 


as mz 
nm— +a—-+ be = Ae (6.47) 
‘ic dt 


instead of (6.42), noting thai the real part of every solution of (6.47) satisfies 
(6.42), while conversely every solution of (6.42) is the real part of some solu- 
tion of equation (6.47) [why ?]. 

If both roots of (6.43) are different from fw, the general solution of (6.47) 
is given by 
Ale’! 
mio) + a(iw) + b 


a> Cie ~+- Cue a 


if 2, $4 Ae, and by 


ital 


Ale 


2 = Cye™ + Cate + —“*___ 
m(iw)” -+ atin) + b 


if 2) = Ao = A. The first two terms in these formulas give the general solution 
of the homogeneous equation (6.44). This solution is bounded as t— + %, 
for arbitrary C, and C,. The second term is a particular solution of equation 
(6.47), found by the rule given at the end of Sec. 47. If a@ > 0, the first two 
terms in both formulas approach zero as ¢— +- 20, and hence the solution of 
(6.47) approaches 
ne” 

mi)” 4- a(iw) -- b ) 


(6.48) 


Thus the smaller the absolute value of mi(iw)? + a(im) -+ 6, the larger the 
absolute value of the function (6.48), for fixed A. 

If ni{iw)*? -+- a(iw) + 6 = 0, which can only happen if a = 0, the general 
soluuion of (6.47) becomes 

ou re | 
SE Oo a Ga (6.49) 
2in{ie) 

The first two terms of (6.49) give the general solution of the homogeneous 
equation (6.44), and remain bounded as f—» +oc. The second term is a 
particular solution of equation (6.41), found by the method of Sec. 47, and 
goes to infinity in absolute value as ¢—> +09. In other words, the solution of 
(6.47) is now an oscillatory function whose amplitude increases without limit. 
In physics this phenomenon is called resonance (in the present case, between 
the free oscillations of the particle and the external force). As just shown, 
resonance occurs when the period of the free oscillations of the particle coin- 
cides with the period of the external force. {t is important to anticipate the 
possibility of resonance occurring In an actual physical system, since the 
resulting “sympathetic oscillations” may well become large enough to 
eventually destroy the system. 
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Problem {. Analyze in detail the case where a < 0, corresponding to oscilla- 
tions with negative resistance. This occurs in a number of physical systems When 
energy is supplied to the system from the outside. | 


Problem 2. Prove that the null solution of the system (6.46) is stable; by 


studying the time derivative of the energy integral 


? 
~ 
2 


mfdx? bh . 
og ai + a" ° 


al 


# 


2 


/ 


AUTONOMOUS SYSTEMS 


51. General Concepts 


If the independent variable does not appear in the right-hand sides of 
the system (4.4), the system is said to be auronomous (or dynamical). In the 
theory of autonomous systems, the independent variable 1s customarily 
interpreted as the time, and the system is then written in the form 


1x - 64 
ee J Nisa a Ads = earn |e (7.1) 
at 
or more concisely as 
dx .. it 
— = /(N), (7.2) 
dt 
where 
Vy It 
Vi Te 
Wee . fH: 
vy, J 


However, the symbol x will also be used to denote the point with coordinates 
ee ee te. os 

For simplicity, we shall assume that the functions f; appearing in (7. ve 
defined for all x and satisfy a Lipschitz condition tn all their arguments in 
every bounded part of space. Then there is a unique solution x = x(f: x") ol 
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the system (7.1) [equivalently, of the system (7.2)] satisfying the initial condi- 
tion .x(0) = x° and defined in a neighborhood of the value ¢ = 0.! This solution 
can be interpreted as the motion of a variable point x in a space of n dimen- 
sions. Where x describes a trajectory {0 which depends on the choice of the 
inital point x*. Note that a trajectory Is not an integral curve, and in fact 
integral curves of (7.2) lie in the (2 +- 1)-dimensional space of points (x, 2), 
The velocity vector corresponding to the motion x = x(#) is given by 
ay 
bear 
dt 
Therefore the autonomous system specifies a velocity field in x-space, assign- 
ing a vector v = f(x) to every point x. A solution of (7.2) corresponds to a 
motion such that the variable point x has the given velocity v = f(x) at each 
point of its trajectory. The fact that r does not appear in the right-hand side of 
(7.2) means that the velocity field is stationary, i.e., does not vary in time. 
Thus the system (7.2) can be interpreted physically as the stationary flow ofa 
gas in x-space, with the solutions describing motions of the gas particles, 
As we know, one of two cases can occur when the solution x(t: x°) is con- 
tinued in the direction of increasing ¢ (similarly for decreasing /): Either 
the solution can be continued onto the whole strip 0 < 1 < ©, or else the 
point x(t; x°) “goes off to infinity’ for some finite ¢ = T (cf. Remark 3, p. 32) 
For simplicity, we shall assume that the first case always occurs. It is easy to 
see that this does not really lead to any loss of generality in studying the 
trajectories of the system (7.2). In fact consider the autonomous system 


dx, ae | 
en Sra Visiiedap Na) ol Nie ance), ‘eer (7.3) 


Where the function ¢ satisfies the same requirements as the /; and is nonvanish- 
ing. Then (7.3) has the same trajectories as (7.1), although the trajectories of 
(7.1) and (7.3) are not traversed with the same velocity. Moreover, for an 
appropriate choice of 9, the velocity of the motion given by (7.3) will be uni- 
formly bounded, so that the moving point cannot go off to infinity in finite 
ume. In fact, we need only choose 


ise 
Thus we shall henceforth assume that the solution of (7.4) equal to x° lor 
== Qis defined for all x, x° in x-space and for all r(—oo << 9). Let this 
solution be denoted by 
X= x(t; x), (7.4) 


ate 


i } * > 81g) ute . + Ps ee a * ws ‘ ; * = TL! ‘ 1 
Here we write x? instead of xy tO prevent any possible confusion with a component of 
the vector x, 
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where, figuratively speaking, x(f; x°) is the point into which x® moves in 
time t. The function (7.4) has the following properties: 


1) x(¢; x°) is continuous in ali its arguments; 
2) X05 x =X, 
3) x[ta; x(t,3 x°)] == x(t, ++ tg; X*). 


Property 1 follows from the theorem on continuous dependence of a solution 
on the initial data, and property 2 is an immediate consequence of the defini- 
tion of the solution x(t; x°). To prove property 3 (often called the group pro- 
perty), we first note that the absence of / in the right-hand side of (7.2) implies 
that if x = (fr) is a solution of (7.2), then so is x = @(f + ft) for arbitrary fy 
(verify this).” Therefore 


ee RS Ne) |: XS Pty x”) 


are both solutions of (7.2). But for t = 0, both expressions give the same 
point x(t,; x°), and hence 


x[ts x(t); x9] = a(t + 0,3 x°) 


by the uniqueness theorem for solutions of systems of differential equations. 
Setting ¢ =f, we obtain the required property 3. Intuitively, property 3 
means that the point into which x° moves in time ¢, + f, can be found in two 
steps: First find the point P into which x° moves in time ¢,, and then find the 
point into which P moves in time f,. These properties are so important that 
an autonomous system is often defined as a family of mappings (7.4) of an 
arbitrary set into itself satisfying properties 1-3, without reference to any 
underlying differential equations.* 
It follows from properties 2 and 3 that each of the two mappings 


= (yx), xX = x(—1; x°) 


of x-space into itself (where ¢ has any fixed value) is the inverse of the other. 
In fact, 

x[—1; x(2 x°)] = x(— 4; x) = x(0; x9) =", 
and similarly, 

x(t; x(—0; x) = x(t — 15 x°) = x(0; x9) = 
It should also be noted that if two trajectories have a common point, then 
they coincide and the corresponding solutions differ only by a constant time 
shift. In fact, if 

x(t,3 82) = x(1,3 4°), 


* The second solution corresponds to the same trajectory / as the first, except that the 
motion along / is shifted ahead by time fo. 


* It is assumed of course that continuity of the mappings on t 
defined. 


he given set can be suitably 
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then | 

NA te See] 
by the uniqueness theorem, since the two solutions coincide for 7 = l. 


Remark. Second-order systems of the form 


i oe dx, aX, ; a 
Ni, — =f x, es fF ey Ns ota y Fe © 6 caaraae % t = I, ee #4 N (7.5) 
adic ‘ dt dt: . 


are encountered in studying mechanical systems with 7 degrees of freedom. 
After introducing new variables 


dx; 
in,—— = p 
dt 
(called momenta), we can write (7.5) in the form 
dx, =p 
dt om, 
Laer p Pa 
ip; . . l “n 
—ee Sil Gece ante 2s * 
dt Ny My! 
Where i= 1,....2. This is an autonomous system in the 27-dimensional 
space of points (44,....24,. Pir. ++, p,,) called phase space. 


Problem, Suppose we are given a function (7.4) in the a-dimensional space 
of points x = (4;,...,.x,) which satisfies properties 1-3 and is continuously 
differentiable with respect to 2, Prove that this function is the solution of an 
auionomous system of the form (7.2) with a continuous right-hand side. 


52. Classification of Trajectories 


THEOREM. A solution X(t) of the autonomous system (7.2) must be 
of one of ihe following three types: 
1) Nonperiodic, which means that x(t,) 36 x(t) if ty Ft} 
2) Periodic. which means that Mt T) = x(t) for sone positive 
constant T (called a period) but x(1,) = x(t.) PO <n <7; 
3) Constant, i.e. x(t) =: xX%. 


Proof. Suppose the solution x(f) is not of type 1, i.¢., suppose there 
are mes fy and to (tf) = 1) such that x(f,) = xX(t.). Writing 7 = & — 4, 
we find that 


Mi =) ex), (7.6) 


since the solutions x(¢ + z) and x(z) agree for 1 = t,. Consider the set 
K of all numbers + sauisfying (7.6). It is easy to see that if 7 belongs to Kk, 
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so does —7 [replace ¢ by ¢ — 7 In (7.6)]. Moreover, if 7, and +, belong 
to K, so does 7, -}- 72, since 


MQO +P Ty ce Fy) = (PE 3,) = X(N), 


and hence so does 7, — zy (why ?).! 
There are now two possibilities: 


a) The set A contains a smallest positive number 7. Then 
x(t + T) = x(t), bul xU,) 4 x.) FO mt < T; ie., the 
solution is periodic and we have a solution of type 2.° 


b) There is no smallest positive number in AK. Then A contains 
arbitrarily small positive numbers (why?). Hence there ts a 
sequence of positive numbers +, belonging to A such that 

7, > O-+-- asu— co, But then 


hn 
f 
{— |—|7, > 0 as n—> &, 


cartel 
a * Ji 


where [y] denotes the integral part of y, and therefore 


, i \ 
xi) = x {1 _ ~}:,) —» x(Q) as HN BD, 
ne ee 
since the solution x(/) is continuous. In other words, x() = x(0), 
and we have a solution of type 3.6 The proof is now complete. 


DEFINITION, A Irajectory is said to be open if it corresponds to a 
nonperiodic solution and closed if it corresponds to a periodic solution. 
A closed trajectory is also called a cycle. A trajectory corresponding to a 
constant solution is called a critical point. 


53. Limiting Behavior of Trajectories: The General Case 


Let x(s) be any solution of the system (7.2), and let / be the corresponding 
trajectory. A point ¥ is said to be a finit point of the solution x(t) [or of the 
trajectory /] as ¢-» -+ 00 if there is a sequence of times f, > 4% such that 
X(t) — ¥. The set of all such points is called the limit set of X(t) asf er on. 
The concepts of a fimit point and a limit set as f—» — 99 are defined in the 
same way? 

“In alvebraic language, the set X is a group with respect ta addition. . 

* Lt is easy to see that in this case, K consists of all integral multiples of 7, 

"In this case, K contains the whole f-axis. | 

"Sometimes limit points as ¢—- — 9% and fr + % are called x-limit points and e- 
limit points, respectively, and similarly for limit sets. 
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For example, suppose that as 1— +- 0 the 
trajectory / “spirals toward”’ a cycle / (see Figure 
28). Then / is the limit set of / as tr—> +00, Tn 
fact, choosing any point ¥ €/ and points 


@y = X(t), 66-5 Ay = X(tn), ose, 


as shown in the figure, we find that 


a, = Xx(t,) > * as t,—* © 


Figure 28 


(why ?). A cycle like / which is the limit set as 
f— +c or as f—» —o of another (distinct) trajectory 1s called a /imit 
cycle (cf. Sec. 24). 

li is easy to see that a critical point is its own unique limit point both as 
t—» +-co and as r— — oo. Moreover, a closed trajectory is its own limit set 
both as ¢—» +00 and as t—» —oo. The limit sets of open trajectories are of 
greater interest, since they determine the behavior of trajectories for large |¢|. 
We now establish some simple properties of limit sets, thinking of limit sets 
as f—» -- co (to be explicit).§ 


THEOREM 1. A limit set 1 is closed, regarded as an n-dimensional 
point set, i.e., | contains all its (set-theoretic) limit points. 


Proof. Let be the limit set of a trajectory / with equation x = x(/), 
and consider a sequence X, €/, ¥,— ¥ as k — oo. By the definition of 
/, given any x, there is a sequence of times t,, such that x(t, ) > X, 


asn—» oo. Let &, be such that f, > & and 


o{x(7,), *} < : ’ 
kk 


where g(a, b) denotes the distance between two points a and b of x-space. 
Then clearly f,,-- oc as k — oo, and moreover 


(ry) <= : ~r ! - os | oy 
olx(tz). 4] < o[x(é,), 4] + o(,, ¥) < k el he hee 0 as k—-» 0. 


In other words, £ is a limit point of x(t) as 1 -+ &, as required. 


THEOREM 2. If ¥ belongs to a limit set 1, then | contains |, i¢., 
consists of whole “trajectories.” 

r} * + ; * bd pe yD) 

Proof. Suppose the original trajectory / has equation x = x(t; 3°); 
and let x(i,,; x°) -» € as n— ow (where ¢, — oo). Then 


XL, -F 65 x°) = x; KU) ext; x); 
ie x(t 8) €L 


Of course, limit sets as ¢—- — wo have Just the same properties, 
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THEOREM 3. A necessary and sufficient condition for a limit set to be 
empty is that 
fi 


x(t) as t+ +m, 


jest 
he, that the curve x(t) “go off to infinity” as t—» 4-2. 


Proof. If the condition holds, the limit set is obviously empty. 
Conversely, suppose the limit set is empty but the condition fails to 
hold. Then 


ni 
> xi(t) a R® 
ts} 

for some RK and sufficiently large ¢. Hence there is a sequence of times 
i,» CO (kK — co) such that the sequence of points x(¢,) is bounded. But 
a bounded sequence x(f;) contains a convergent subsequence, and the 
limit of this subsequence must be a limit point of x(2) as f—> +o. 
This contradicts the assertion that the limit set is empty, thereby 
completing the proof. 


THEOREM 4, A necessary and sufficient condition for a limit set to 


consist of a single point % is that x(t)—> X as t— +0, Le., that the 
trajectory “enter the point X° as t—> +. 


Proof. If the condition holds, the limit point is obviously unique. 
Conversely, suppose the limit point is unique but the condition fails 
to hold. Then, given any ¢ > 0, there is a sequence 4, — 4-00 (A —» 2) 
such that of[x(z,), #] > ¢ for all ¢,. But by the definition of *%, there ts 
another sequence {7 -> +o such that e[.v(1,), ¥] << for all sufficiently 
large ¢{. Therefore, by the continuity of x(s), there must be a third 
sequence “ such that p[x(r"), ¥] = ¢ for all sufficiently large 7. This 
bounded sequence &’ contains a subsequence converging to a point ¥ 
which is a limit point of x(¢) as ¢—- +o. But obviously a(%, X) = ¢, 
and hence there is at least one other limit point besides +. This contra- 
dicts the assertion that ¥ is the unique limit point, thereby completing 
the proof. 


Problem, Prove that if a limit set / is nonempty and bounded, then it is 
connected, i.e., it cannot be represented as the union of two nonempty disjoint 
closed sets. (In particular, it follows that if / consists of more than one point, 
then it contains infinitely many points.) Show that if / is unbounded, then it 
may not be connected (give an example), However, show that / becomes con- 
nected if the “point at infinity in x-space” (i.e. the point “approached by every 
sequence of points x going olf to infinity”) is adjoined to f; 
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54. Limiting Behavior of Trajectories: 
The Two-Dimensional Case 


If n = 2, the system (7.2) reduces to 


ae = fi(Xy, Xe). oe == fa(X1. Xa). (7.7) 
di! at 
Then. as shown by Bendixson, much more ih be said about the limiting 
behavior of trajectories than in the case 1 > 2. We now give some of Ben- 
dixson’s results, which are based on the fact that a closed curve in the plane 
(with no self-interseciions) divides the plane into two parts.” 


THEOREM 1. /f-@ trajectory | contains at least one of its limit points 
¥ as t—»-- 9 or as t—» —H, then | is either a closed trajectory or a 
critical point, 


Proof. i be ae Hicit, let —» + oo, If ¥ is a critical point, the whole 
trajectory reduces to the single point ¥ (why ?), and the theorem is proved. 
Thus suppose ¥ is not a critical point, and 
consider a small neighborhood U of &, 
bounded by two segments of normals to a 
smail are of the trajectory /; containing ¥ and 
by iwo arcs of neighboring trajectories (see 
Figure 29). The trajectories inside U form.a 
family of slightly curved parallel arcs, along 
which the motion proceeds with almost con- 
stant velocity. By hypothesis, the trajectory / 
under discussion passes through X. Since ¥ Is 
a limit point of /, the trajectory / must inter- 
sect U aguin as ¢is increased. We now show that / passes through Xx again 
as fis increased, i.e.. that / is 2 closed trajectory, In fact, suppose l does 
nol pass through ¥ again. Then the arc of / together with the segment 
ca formsa closed curve (see Figure 29), bounding a finite region S of the 
plane, which we call a “Bendixson pocket,” Suppose the trajectories of 
the system crossing ca enter S from the outside, as in Figure 29. Then / 
is doomed to stay inside S forever afler, Since it can neither intersect the 
arc abe (an open trajectory has no self-intersections) nor the segment ca 
(all trajectories crossing ca must enter S from the outside). But then after 
entering the pocket ait ¢, the trajectory / can never come arbitrarily 
close to X, contradictiny the fact that Tis a limit point. Similarly, if the 
trajectories crossing ca leave S from the inside, as in Figure 30, the 


Si 


FIGURE 29 


4s ree Mer 7” Z * _s - 5 7 . 
Phis is not true in spaces of hivher dimension, 
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trajectory / can never return to the pocket S after leaving it at c. and 
hence can never come arbitrarily close to ¥, again contradicting 
the fact that ¥ is a limit point of /. This com- 
pletes the proof, 

LumMaA. /f a trajectory | has a limit point as 
t—» -+-00 belonging to a closed trajectory 1, then 
| either coincides with | or else spirals toward 1. 


Proof. According to Theorem 2, Sec, 53, PiGine 30 
the limit set of / contains the whole trajectory 
I, Consider a small neighborhood U of any point * €/ (see Figure 31). 
By the definition of a limit point, the trajectory / must intersect U for 
arbitrarily large 7, and hence, since the trajectories inside UV are “almost 
parallel,” / must intersect the normal <” to / (drawn through .) at some 
point inside U. If the point of intersection is the point ¥ itself, then / = i 
Otherwise, let the point of intersection be a 
point a. which, for definiteness, is assumed to 
lie outside /, If @ is close enough to ¥, then 
the theorem on continuous dependence of a 
solution on the initial data implies that / will 
again fall in Uafter a time T equal to the period 
of traversing the cycle / once. Let ¢ denote the 
first intersection of / with Xn after a. Then the 
whole are ac of / lies in an arbitrarily narrow 
strip along the cycle /, provided that a is close 
Figure 31 enough to ¥ (why?). It is easy to see that 
a(c. ¥) << ofa, ¥). In fact. if ¢ =a, then / Is 
a cycle and henee never approaches ¥, while if / traverses the dashed 
path in Figure 31, it can never enter the “pocket” abda (why not?) and 
once again cannot approach *. Continuing the trajectory / after ¢, we 
obtain another “loop” around / and another point of intersection of / 
with 2, and so on. Since * is a limit point of / these points of inter- 
section must approach .¥, and hence, because of the theorem on con- 
tinuous dependence of a solution on the initial data, the consecutive 
loops of / must converge uniformly to /, This proves the lemma. 


THeorem 2. Lee G be a closed bounded domain containing NO 
critical points of the svstem (7.2), and suppose every trajectory beginning 
inside G att = 0 stays inside G for all t >> QO. Then every such trajectory 
is either a cyele, or else spirals toward a cycle ast 8. In particular, 
G must contain at least one cvele. 


Proof. Consider any positive half-trajectory / beginning inside G, 
f 


Le, a curve vez x(t¢ x") where x"EG and 0 1 = &. Since / Is 
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coniained in G, by hypothesis, it has at least one limit point £¢G as 
f— +o, by Theorem 3 of Sec. 53. If Ye/, then / is a closed. trajec- 
tory, by Theorem | of this section. Otherwise, consider the positive 
half-trajectory / beginning at ¥, and let ¥ be a limit point of ! ast —» -- 0, 
We need only show that ¥€/ since then 
Theorem | implies that/ is a closed trajectory, 
and hence / spirals toward / by the lemma. If 
¥ does not belong to /, consider a small neigh- 
borhood cf ¥ (see Figure 32). Let @ and c be 
two points of intersection of / with the normal 
drawn through ¥ to /:, and suppose the trajec- 
tories crossing the segment ac of the boundary 
abca or the “packet’’ S enter S from the out- 
Ficure 32 side, as in Figure 32. By Theorem 2 of Sec. 53, 
every point of / is a limit point of /. Therefore 
/ eventually comes arbitrarily close to ¢ and hence enters the pocket S. 
But then / must stay inside S forever after, and hence cannot approach 
the point a €/ (see Figure 32). despite the fact that a is a limit point 
of / as f—» +oo. The case where the trajectories crossing ac leave S 
from the inside is treated similarly. This contradiction completes the 
proof. 


THEOREM 3. Given a closed trajectory |, suppose there are no other 
closed trajectories in some neighborhood of |. Then every trajectory 
beginning sufficiently close to | spirals toward las t — +-2 or ast—> —@. 
Moreover, it is impossible for one trajectory outside | to spiral toward | 
as i —» —-% while another spirals toward | as t —- — wo, and similarly for 
irajectories inside 1. 


Proof. Since there are no critical 
points on /. there is a narrow strip D 
containing /in which there are also no 
critical points (why?), and moreover 
no cycles other than /. Let an be a 
fixed, sufficiently small segment of the 
normal to / at some point *€/ (see 
Figure 33). All the trajectories beein- 
ning sufficiently near /come arbitrarily 
close to / when continued forwards and 
backwards in time, and hence intersect 
mn. Therefore we need only consider PIGURE Ss 
trajectories beginning on the segment 


mi Near X. Suppose we draw a trajectory through some point @ én. 
If b is the next point of intersection of this trajectory with av, then 


SEC. 54 AUTONOMOUS SYSTEMS 173 


ba (why?). Assuming that (4, %) < (a, X), and considering the 
“ring-shaped pocket’ bounded by the arc ah of the trajectory through 
a, the segment ba and the cycle /, we find that the next intersection of 
the trajectory with nm after 6 occurs at a point c on the segment xb, 
the following one occurs on the segment Xc, and so on. The resulting se- 
quence of points of intersection must converge to *, since otherwise there 
would be a cycle distinct from / passing through the limit point of the se- 
quence (think this through !). In other words, the given trajectory spirals 
toward /. The other trajectories beginning outside / (and sufficiently near 
/) are “squeezed between the loops’’ of the given trajectory, and hence 
also spiral toward / as tf - + co. On the other hand, if o(b, £) > ofa, %), 
we need only reverse the time direction (replacing t by —1) to reduce the 
problem to the case just considered, i.e., the trajectories now spiral toward 
fas f—» —oo. Since the part of the strip D lying inside / can be handled 
in the same way, the proof is now complete. 


Remark. Jt follows from Theorem 3 that an isolated cycle, i.e., a cycle 
which has a neighborhood containing no other cycles, must take one of the 
three forms shown in Figure 34, called stable, unstable, and semistable, 
respectively. 


FIGURE 34 


Problem I, Give an example showing that Theorem | is false for an autono- 
mous system in #-dimensional space if a> 3. 


Problem 2. Give an example showing that a stable limit cycle need not be 
Stable in the sense of Lyapunov. What extra conditions guarantee Lyapunov 
stability ? 


Problem 3. Suppose the right-hand sides of the autonomous system (7.7) 
are polynomials of degree no higher than two. First prove that no curve other 
than an integral curve can be tangent at more than two points to the direction 
field determined by (7.7). Using this fact, prove that 

a) Every closed trajectory of (7.7) is convex; 

b) The motion along two closed trajectories proceeds in opposite directions 

if one trajectory lies outside the other, and in the same direction if one 
lies inside the other. 
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Prove that there can be no more than one critical point inside a closed trajectory, 


Comment. It can be shown, but not too easily, that a critical point lying 
inside a closed trajectory is a focus.!” | 


55. The Succession Function 


A useful concept in studying cycles and certain other problems. is that 
of the succession function. In essence, this notion has already been used in 
Sec. 54. For simplicity, we confine our discussion to the case # = 2, corre- 
sponding to an autonomous system (7.7) in the plane. Suppose we are givena 
smooth curve L (see fooinote 5, p. 6) which has no self-intersections and no 
“points of contact with trajectories of (7.7),” 1.e., no points where L is tangent 
to a trajectory. For example, L might be a small segment of the normal toa 
trajectory, as in Sec. 54. Suppose the position of a variable point aéL is 
Specified by the value of a parameter 7. so that @ = a(z)." Drawing the tra- 
jectory of (7.7) passing through the point a(z,) in the direction of increasing 
lime, we continue the trajectory until its first intersection with L, if such an 
intersection occurs. This point of intersection corresponds to some parameter 
value + = 7, depending on <,, and leads to a function t, = U(%o) called the 
succession function, 

In general, 4(<) is not defined on the whole curve L, but only for values of 
> such that the continuations of the corresponding trajectories intersect L 
again. In fact, 4(+) may not be defined at any point of L at all. However, if 
(+) is defined for a given <p, then it must be defined and continuous for all= 
sufficienuly near zy. This follows from the theorem on continuous dependence 
of a solution on the initial data. and from the fact that the curve L has no 
points of contact with trajectories (so that any trajectory meeting L must 
Intersect L). 


The following theorem shows how the succession function can be used to 
find cycles: 


THEOREM. A necessary and sufficient condition for a cycle to pass 
through the point a(z») is that U(x») be defined and that &(%») = Fo 


Proof. The sufficiency of the condition is obvious. To prove the 
Necessity, suppose first that 4(zy) is not defined. Then the trajectory 
fz, Hever intersects L for z > 0 and in particular cannot pass through 


a “% “1 E " ~ a ‘ F 
See Pung Chin-chu, Positions of limit-cycles of the system 


dx \ ef y = 

i aes rb yek - Sei : of a 

eee” Gate 
ONirk 2 Gh gc fteecce 


scientia Sinica, 8, 151 (1959), 


fko* oe ‘ ee silos . E . ‘ . ep “ofa e we Mitr 
Phe letter ¢ is reserved for the time, ie., for the parameter of trajectories of the system 
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a(zq) again. Therefore /,,.., cannot be a cycle in this case. Thus suppose 
that +, = h(<y) Is defined but 7, 4 <q, and let a(zy) = ay, a(=,)) = ay. 
Then the are aga, of the trajectory 
/,, and the are a,ay of the curve L 
together form the boundary ofa “Ben- 
dixson pocket,” since L has no points 
of contact with trajectories and hence 
the trajectories crossing L all go In the 
same direction (why?). The are aj,dp 
of the curve L can be intersected by 


trajectories entering the pocket as in (a) 
Figure 33(a), or leaving it as in Figure FIGURE 35 


35(b). In either case, it is clear that the 

trajectory /,, whose continuation passes through the point @, can never 
approach ay, again. Therefore /, cannot be a cycle, and the theorem is 
proved, 


—— 


Remark. Suppose a cycle passes through the point a(<9), so that Y(%») = 
<9. Then the character of the stability of the cycle (recall Figure 34) depends on 
the behavior of U(+) near the value + = 7p. If the cycle is isolated (i.¢., fit has 
a neighborhood in which there are no other cycles), then U(+) = < for suffi- 
clently small |z — zyl > 0. It is easily verified that the cycle is stable, unstable 
or semistable, depending on whether 4(+) has the behavior shown in Figure 
36(a), 36(b) or 36(c). 


(b) 
FIGURE 36 


Problem J, Prove that 2(z) is an increasing function. 


Problem 2, Prove that if the system (7.7) has a stable limit cycle /, then, given 
any ¢ > 0, there exists a 8 > 0 such that any system obtained by changing the 
right-hand sides of (7.7) by less than 6 has at least one cycle in an e-neighborhood 
of /. Prove that if there are finitely many such cycles, then the number of stable 
cycles is one larger than the number of unstable cycles. How about the case 
where (7.7) has an unstable limit cycle? Prove that arbitrarily small changes in 
the right-hand sides of the system can lead to disappearance of a semistable 
cycle, 
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Problem 3, Examine the structure of a “sufficiently narrow” neighborhood 
of a nonisolated cycle. Prove thet the neighborhood is completely filled by 
cycles or parts of cycles if the right-hand sides of the system are analytic. 


Problem 4. Given an n-dimensional autonomous system, introduce the con- 
cept of a succession function for a smooth (7 — |)-dimensional surface which 
has no points of contact with trajectories. Establish the relation’ between this 
function and cycles. Assuming that the mapping defining the succession function 
has a principal linear part in a neighborhood of a fixed point, give sufficient 
conditions for stability of the corresponding cycle. 


56. Behavior Near a Critical Point 


Itis easy to see that a necessary and sufficient condition for a point x =» 
to be a critical point of the autonomous system (7.2) is that 
f(x’) =0 (7.8) 
[substitute x(7) = x° into (7.2)]. Writing (7.8) in component form, we obtain 
n equations in the 1 unknown coordinates of the critical point. In the case of 
a two-dimensional autonomous system (7.7), the coordinates of the critical 
point satisfy the equations 


AGr x2) =0, fal, x2) = 0. (7.9) 
Since (7.7) and (7.9) together imply 


dx2 _ fel X2) (7.10) 
AX, f,(%1, Xe) 
We see that a critical point of (7.7) is a singular point of (7.10) in the sense of 
Sec. 22. It follows from the considerations of Sec. 22 that the behavior of the 
trajectories near a critical point may be quite complicated. | 
We now study what happens in a neighborhood of a critical point, under 
rather general conditions. Specifically, we assume that 


1) The critical point is at the origin; 
% Pusls ae ey ~ ‘ ‘e , E Sa) ak ; . og f 
2) Each of the functions i, fz appearing in (7.7) can be represented as the 


sum of a homogeneous polynomial of degree m >> | and higher-order 
ferms, 1.., 


FAX, X2) =< POS: Xz) —— vi(Xd, Xa), j=: 1 2 (7.11) 


xis Xa) = o(lx,J™ + Lx.{™). 


i? 


* This can always be achieved by parallel displacement of the coordinate anes. 


For the meaning of the symbol o, see Prob. 6, See. 49, 
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[tis convenient to first transform to polar coordinates 9 and ¥, related to x, 
and x, by the formulas 


Vee 005-4. Ne = 9 Sin 9. 


be -_ 


Thea the system (7.7) becomes 


do lf dx dx, | . 
ia (x Paes =) = (xP) + XeP2) + 0(9"), 
dig dt dt 9 
/ \ 
d | dx., dx Es mate _ 
cf —(x —— — Xe =} == (xP. — x2P,) + 0(9"), 
df 9” dt dt/ 9” 
or 
do de a | oe 
Fi — 5"O(«) ae o(o”), . -_ 3” 'R() oe o(o™ ') (7.12) 
( é 


in terms of the functions 


O(2) = P,(cos 9, sin 9) cos o + P.(cos “9, sin g) sin 


R(®) = P.{cos 9, sin ©) cas 9 — P,(cos 4%, sin 9) sin 9. 


These functions are both homogeneous polynomials in cos g and sin 9 (of 
degree m -+ 1), and hence are periodic with period 27. 


1, First we consider the case where R(@) = 0(0 < 9 < 2%), say R(g) > 0. 
Then, according to the second of the equations (7.12), ina small neighborhood 
of the origin every trajectory moves around the origin in the positive direction 
as f increases. Moreover, it follows from (7.12) that 


= ; b 
peda  R(@) + o(1) 
and hence, integrating along any trajectory, we find that 
rxttx O(c) + o(1) 7, 
AEN ae. (7.14) 


In o(% -+ 27%) — In a(x) a Reveal) 


Suppose first that 


pal" 22 ae <0. (7.15) 


Integrating (7.13) between the limits « and % +- @ (0 < 9 < 2s), instead of 
between « and » 4- 2x, we find that the difference In e(¢ + 9) — In o(a) is 


tae big 


bounded in a small neighborhood U of the origin, and hence the ratio 


=, 


t ~ 
: aie Pn ed 
i sienna (0 ton, ? it 2 i =) 


is bounded both from above and from below by positive constants. Thus ipa 
lrajectory begins sufficiently near the origin, it remains in U after making a 
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complete circuit around the origin. But if the neighborhood U is small enough, 
the right-hand side of (7.14) will be arbitrarily close to I, say between $/ and 
27, Then (7.15) implies that In p > —co and hence o—>O0 as 9>+ 0. 
Hence, under these conditions, every trajectory beginning sufficiently near 
the critical point spirals toward the point as f-> + ©, winding around the 
point an infinite number of times in the positive direction. In this case, the 
critical point is called a stable focus (cf. Sec. 22). 

Similarly, if J > 0, it can be shown that the trajectories all spiral toward 
the critical point as ft» —©, i.e., the critical point is an unstable focus. If 
I = 0, it can only be shown (do so!) that if any trajectory / approaches the 
critical point as ¢—> +00 or as?» — oo, then /isa spiral trajectory and more- 
over so is every other trajectory beginning near the critical point (which is 
again a focus). However, if J = 0 it may happen that every neighborhood of 
the critical point contains closed trajectories surrounding the critical point. 
Then if all the trajectories are closed for sufficiently small p > 0, the critical 
point is a center (cf. Sec. 22). Otherwise the critical point is called a “center- 
focus,’’ but Poincaré has shown that a critical point of this kind is impossible 
if the functions /4(%,, X2) and fo(%1, ¥2) are analytic. Which case actually occurs, 
a center or a focus, can only be ascertained by studying the higher-order 
terms W, in (7.11). This complicated problem will not be gone into here. 


2. Next we consider the case where R(¢) takes values of both signs. Then 
R(@) has no more than m + | zeros for0 < ¢ < 7 (why ?), and shifting these 
zeros by 7 gives the zeros of R(o) for wm < 
© < 2r. In this case, every trajectory approach- 
ing the critical point as ¢ > -- 00 or asf > — % 
has a definite direction @ at the critical point, 


+ €/ 
is 0. MLL LLL where R(@) = 0, and hence there can be no 
4 € YAAUA more than 2m + 2 directions of approach to 
; the critical point. 
R>O 


To prove these assertions, we represent the 
trajectories in an auxiliary pp-plane (see Figure 


Pa traj 5 the critical 
P20. LLLLIE 37). If a trajectory l approaches the cri He | 
(pp — € KACARZEZEZ/ 5p point, then, starting from a certain time, / must 


lie in the strip 0 <p < 9, where og, 1s any pre- 
assigned positive number. Given any ¢ > 0, we 
draw the lines = ©, -- ©, where the ¢, denote 


Tope ae eae ee the zeros of R(o). If ge is small enough, the 
P; Wy trajectory can intersect each of the unshaded 
P,— € KLENALA rectangles shown in the figure in only onc 


direction, either upward or downward (why). 
On the other hand, integrating (7.13) from oy" 
ete a. tog, where p, +e < 9* <@ < Pa — & We gel 


% 
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an upper bound for the factor by which 9 changes as the point (¢, 9) traverses 
the unshaded rectangle IT. Then the second of the equations (7.12) implies that 
once having entered the rectangle IT, a trajectory must leave II within finite 
time, crossing the line ~ == 9. — =, and similarly for every other unshaded 
rectangle. But as 9 —» 0, a trajectory can intersect only a finite number of these 
rectangles (why ?). Therefore, starting from some sufficiently small value of 
o, a trajectory approaching the critical point must stay forever in one of the 
shaded rectangles, Since <« can be made arbitrarily small, this proves that 9 
approaches one of the 9, as 9 —> 0, 


3. Now let R(@) = 0, O(3) & 0, and suppose R(e) changes sign in passing 
through ©. In this case, at least one trajectory approaches the critical point 
along the direction @. Moreover, let S be an angular sector with vertex at the 
critical point and bisector 9 = %. Then every 
trajectory in S approaches the critical point along 
the direction @, if the radius and angle of S are 
sufficiently small and if R(o)Q(9) changes sign 
from ~— Lo + in passing through &. 

The proof of these assertions goes as follows: 
Itcan be assumed that O(4) < 0, since otherwise 
we need only replace ¢ by —?, which causes R 
and Q to go into —Rand — Q without affecting 
the sign of RO. Suppose R(2) changes sign from 
+ lo —, Then for sufficiently small < > 0 and 
fy > 0, the velocity field on the boundary of the Figure 38 
rectangle IT: Jo — S| << s,0 < 
38) points into Il if 9 > 0 (I corresponds to a sector in the x,x,-plane). 
This means that the trajectories beginning in [] cannot leave I as ¢ 
increases. But it follows from the first of the equations (7.12) that every 
such trajectory reaches any line ¢ = 0 (it 5 > 0 is sufficiently small) in 
finite time, and hence approaches the critical 
point as f—» +o, where, as shown above, the 
direction of approach Is 6. . 

On the other hand, if R(g) changes sign 
from — to -+, then, for p > 0, the velocity 
field on the boundary of the rectangle IT is 
directed as shown in Figure 39. Consider the 
trajectories beginning on the segment AB. 
According to the theorem on continuous de- 
pendence of the solution on the initial data, 


< 9 < pq (see Figure 


‘S». 
5, 
he 


if one of these trajectories leaves I] through the 
Figure 39 upper (or lower) side, then so does every other 
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trajectory beginning sufficiently near the given trajectory. Let C be the least 
upper t bound of all points of AB at which trajectories begin and subsequently 


leave I] through the lower side. Then the trajectory beginning at C can never 


leave TI (why not?). Therefore, by the previous argument, this trajectory 
approacl ies the critical point along the direction 6, and the proof of our asser- 
lions is now complete. 


4. Again let R(@) = 0, Q(8) 4 0, and suppose R(@) changes sign from 
— to + in passing through &. Suppose further that R'(&) + 0 and that the 
functions Y; appearing in the system (7.11) have continuous first derivatives 
with respect to X, and X, of order o(9""7) as 9 — 0. Then only one trajectory 
approaches the critical point along the direction &. To see this, we first write 
(7.13) in more detail as 
do p™ I R(~) + xyte — Xt, 


pot = El = Up, 9). (7.16) 


do  o™*1O(a) + x, + xobs 
A direct calculation shows that the right-hand side of (7.16) has a continuous 


derivative with respect to 9 in II (see Fig igure 38) if 9 > O, where this derivative 
can be made arbitrarily close to 


cs 
~ 
nm, 
ey 
age 


« 


(7.17) 
Q(9) 
if < and 9, are sufficiently small. Now suppose two distinct trajectories with 
equations 
o = 9,(9), Oo = (5) (7.18) 
approach the critical point along the direction 4. Substituting (7. 18) into 


(7.16) and subiracting the first of the resulting equations from the second, 
we obtain 


d( — 91) 


= U(e, %2) — U(s, 91). (7.19) 


ds 


According to Hadamard’s lemma, the right-hand side of (7.19) can be written 
in the form 


(S2 — DFC, Gis $2): 
where it clear from the proof of the lemma (see Sec. 20) that the function F 
is continuous in all its arguments on Il if 9 > O and 


: note OB) ‘ 
F( 8, 24. G2) -> O oe aS 9, —> G, 2 —* GP” 0+. 
(@) 
fer eG 


Thus (7.19) becomes 


U0, -- 9 | 
pA) (22 — iF le. eile). 92(9)] (7.20) 


ds 
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where the function A(@) is continuous and approaches the limit (7.17) as 
o->O-+. Integrating (7.20) as a homogencous linear equation (see Sec. 7), 
we find that 


. aD ar = Cexp | paw Ir . 


seo | 


Q2— 9 = Cexp | 
. Pp r 


But the last integral approaches -- «% as ¢ —- 0-+ (why?), and hence C must 
vanish if the left-hand side is to remain finite. In other words 9,(9) = %.(9), 
contrary to the assumption that the two solutions (7.18) are distinet. Therefore 
as asserted, only one trajectory approaches the critical point along the 
direction %, 


5. Finally let Rj) == 0, O(e) = 0, and suppose that each of the functions 
4, appearing in the system (7.11) can be represented as the sum of a homogeneous 
polynomial of degree im -- 1 and higher-order terms. Then at least one trajectory 
approaches the critical point along every direction 4 for which Q(2) = 0 
[there are no more than 2 -+ 2 directions for which ue = Oj. -in fact: 
under these conditions, dividing both sides of (7.16) by 9 leads to an ex- 
pression on the right which is continuous everywhere in HT, including the 
left side of Il. Then our assertion follows from the theorem on the existence 
of a solution of a differential equation with a continuous right-hand side, 
satisfying the initial conditions o = 0,9 = 3.4 


Example. We now apply these results to the most important case m7 = 1. 
Then the system (7.7) takes the form 
dx iiXs | 
—+ = ax, -+ bry -+ o{2), —= = ex, + dx. -+ o(s), Vit) 
dt df 
and the functions Q(¢), R(o) are the following homogeneous polynomials 
in cos 9 and sin 0: 
O(9) = (a cos g + Asin 9) cos 9 + (ec cos o + dsin 9) sing 
= a cos? 9 -+- (b + c) cos osing -+ dsin* 9, (7:22) 
R(g) = ccos* 9 + (d — a) cos g sing — bsin* 9. 
We shall assume ihat 
ad — he =: ( (7.23) 


since otherwise O() and R(g) have common zeros, a case which was not 
considered above. Note that the condition (7.23) was also imposed in See. 22. 


1 Suppose it is also known that the functions Yy, have continuous partial derivatives 
with ile to x, and x. of order O(2™) as ¢ — O. Then it ts noth ard to see that the deri- 
vative of the right-hand side of (7.16) is of order O(c). and hence only one integral curve 
can approach the critical point along the direction @. 

In fact, Oe) and Rig) have the common factor ecos 9 > dsing iffe) + Id) >> 0, 
and the common factor a cos @ -+ bsin pg ife = d= 0 
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If 
(d — a)? + 4bc < 0, 


then a direct calculation shows that 


Ore O(¢) b a — d 
—- do = —20 - ——=_=__—__ 
J R() i |b| /—(d — a)? — 4bc 


(verify this). Therefore the origin is a focus if a + d 0, and the integral 
curves wind around the origin in the positive direction if b(a + d) > 0 and 
in the negative direction if b(a -+ d) < 0. Since do/dt and b have opposite 
signs, we see that the focus is stable if a + d > 0 and unstable ifa +d<0 
(deduce this result from the theorem of Sec. 49). However, if a + d = 0, we 
can have a center, a focus or a center-focus, depending on the behavior of 
the nonlinear terms in the system (7.21). 
Next let 
(d — a)? + 4be > 0 (7.24) 


Then the function R(¢) with period m given by (7.22) has two zeros in the 
interval 0 < » < 7, and changes sign in passing through each of these zeros. 
Therefore the zeros of R(¢) in the interval 0 < » < 2x determine two pairs of 
opposite directions, along each of which at least one integral curve approaches 
the origin as tf > -+ 00 or as t—» — oo. To study this case in more detail, we 
note that (7.24) implies that the characteristic equation 


a—i b 
Cc d—i 


2 


1.€., 
2 — (a+ d)A — ad — be = 0, (7.29) 
has distinct real roots. Therefore, according to Sec. 48, there is a nonsingular 


linear transformation with real coefficients reducing the system (7.21) to the 
form 


= 1,8, -++ o(p*), —— > == hate 00"), (7.26) 


dé, dé 
dt dt 


where A, and A, are the roots of equation (7.25), and e* = J £2 1 £2 is the 
radial distance in the new ¢,5-plane. Instead of (7.22), we now have 
| O*(@*) = A, cos” 9* + Az sin? o*, 
R*(p*) = (Aq — Ay) COS o* sin o*, 
where ¢* is the polar angle in the €,5-plane. It follows from the second of 


these equations that the integral curves approach the origin along the 
coordinate axes. Here we must distinguish two subcases. If A,Ag > 0, Le., 


ad — be > 0, (7.27) 
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and if A, -+- Aw = a -+ d < 0. then domains of the kind shown in Figures 38 
and 39 alternate in the g*9*-plane, and hence the origin is a stable node 
(i.c., the motion along the trajectories is directed toward the critical point 
at the origin). If, in addition, the right-hand sides of the system (7.21) are 
continuously differentiable, then the conditions in italics on p. 180 are 
satisfied (why ?), and hence one integral curve approaches the origin along 
each of two opposite directions while every other integral curve sufficiently 
near the origin approaches the origin along one of the other two opposite 
directions (see Figure 12, p. 68). On the other hand, if (7.27) holds but 
a-- d >> OQ, then the origin is an unstable node. The other subcase ad — be < 0 
is treated in Problem | below. 
Finally suppose that 
(d — a)? + 4bce = 0. 


Then there are again two subcases. If [b] -- |e] > 0, the function A(z) =¢ 0 
does not change sign but has zeros. Although this case has not yet been 
considered, it follows from Problem 3 below that the origin is a node with 
two directions of approach if 


= OG"), i== 1,2 


a é 


for some < > 0 (see Figure 15, p. 69). On the other hand, if b= c¢ = 0, 
then ) 
aq=d, Rio) =0, Ofe) =a= 


It follows from our previous considerations that if each of the functions ¥y, 
can be represented as the sum of a homogeneous polynomial of degree 2 and 
higher-order terms, then the origin is a node which is approached by integral 
curves along every direction. 


Problem 1. Prove thatifad — be -< Oand if the right-hand sides of the system 
(7.21) are continuously differentiable, then the origin is a saddle point (see 
Figure 14, p. 69). 

Problem 2, Give an example of an autonomous system with a critical point 
of the center-focus type. Examine the structure of a neighborhood of a critical 
point of this type in the general case, 


Problem 3. Analyze the case where R(g) = 0 does not change sign but has 
zeros. In particular, assuming that vy, -- Oe). Sl 2 TOF some 20, 
R(3) 0, Q(%) 2 O and examining the direction of the velocity field on the 
lines go = & + so! where 0 ~ A cs, prove that infinitely many integral curves 
approach the origin along the direction g = Sas? -- + orasr~-- —&, More 
exactly, if for definiteness R(s) > 0, O() -. Qand if 9 is the first zero of R(2) 
for o -< & then for any 9%. 9 < gt ~. 9, there exists a p* > O such that every 
Integral curve beginning at QO - 6 <5 s*, 3 = 9* approaches the origin along 
the direction q@ = Bast -- + ow, 

Problem -4, Classity the isolated critical points of an autonomous system for 
the cause z= 3, assuming that the right-hand sides are linear and that the 


- 
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characteristic equation has distinct roots. Examine each of the five kinds of criti. 
cal points froma geometric point of view. Carry out an analogous classification 
for arbitrary 7° Which of the critical points are “crude,” 1e., do not change 


their type if the coefficients of the system are changed only slightly? 
P « ed = 


57. Theory of Indices 


Derinition 1. Let f(x) = f(4y. ¥2) be @ continuous vector field in 
the plane, with components f,\(Xy. Xz), PX. Ny), and let L be a piecewise 
smooth oriented Jordan curve’ passing through no critical points of 
f(x)28 Moreover, let 0 = O(x) be the angle which f(x) makes with a fixed 
direction 1, and ler AO be the change in 9 as the point A = (x,, Xe) traverses 
L once in the positive direction. Then the quantity 

AO 


I(L) = 7 


! 


™~ 


is called the index ef L with respect to f. 


Remark ]. The concept of index is due to Poincaré and can be used to study 
two-dimensional autonomous systems, in which case the components of f(x) 
are the right-hand sides of the system (7.7). However, the considerations 
that follow apply equally well to any continuous vector field in the plane. 


Remark 2. In calculating AQ, it ts 
assumed that we choose a definite branch 
of the multiple-valued function 0 = 0(y). 

i Thus the angle is a continuous function 
.T a } put there is no unique choice of the initial 


value of 0. For example, in the case of the 
\ vector field and curve shown in Figure 40, 
4 AQ = 2= and hence /,(L) = 1, regardless 
* 


SAN | of whether the value of the angle at the 
initial point Ay is taken to be 52/12, 


FiGure 40 2977/12, — 1972/12, ete. 


For a discussion of critical points of nonlinear autonomous systems for arbitrary # 
sce V. V. Nemytski and V. V. Stepanov, up. cit., Chap. 4. 

* £ is said to be Jordan if it has no self-intersections, and oriented if a definite direction 
along L is regarded as positive. We assume that L contains its end points, which coincide 
if L is closed. We shall always take the positive direction along a closed curve L to be the 
counterclockwise direction, | 

* By a erttical point of fs), we mean a point where fix) — f(x) = 0, just as on 
p. 176. | 

* Concerning this whole subject, see M. A. Krasnoselski, A. I. Perov, A. L. Povolotski 
and P. P. Zabreiko, Vector Fields in the Plane (in Russian), Gos. Izd. Fiz.-Mat. Lit., 
Moscow (1963), 
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Remark 3. \t is easy to sec that /,(L) does not depend on the direction /, 

nor on the initial point A, if L is closed. 

Next we list some simple properties of J,(L): 


|. Obviously, if —£ denotes L traversed in the opposite direction, then 


I({—L) = —1(L). (7.28) 
Moreover, if L= L, +--+ + L,,% then 
IL) = Ly) 0 + (Ly). (7.29) 


If f, and f, are continuously differentiable, then 


a [ fi Ufo — fe dh, 
ee fa fi 


+ (Hi) a) 


OX» OXe 


Co 


are cs 


Sa | atl (nZ _ Ly) ax, 
— L neon 
2k f, Ty 1 -:- ' Be a ON: 


3. If Lis closed, then /,(Z) is an integer. In fact, since the initial point and 
final pomt of ZL coincide, the vector field f(x) returns to its original 
value after traversing ZL once, ie., AO = 2 where m1 is an integer. 


IAL) = - [ 6 ~_ [ d(arc tan#?] = 
2 an" ie 


4. If L is closed and if Z is continuously deformed into another closed 
curve L’ without ever passing through a critical point of f(x), then 


I({L') = L(L). 


In fact, under such a deformation, both L and f(x) vary continuously, 
and hence so does {(L). But then /,(L) cannot change at all, since, as 
just shown, [,(L) = 2x. 


tay 


_ ff Z is closed and if there are no critical points of f(x) inside LZ, then 
[({L) = 0. To see this, we deform ZL into an arbitrarily small circle 
L’ inside L, with center a. Then, according to property 4, /,(L) = [,(L’). 
But /,(L’) = 0, since the values of f(x) on L’ are uniformly close to its 
value ata, and f(a) = 0 [think this through]. 


THEOREM |. Jf L is a closed trajectory of the two-dimensional 
autonomous system (7,7), then [,(L) = | 


Proof. In this case, the components of f(x) are given by the right- 
hand sides of (7.7). The curve L is a smooth Jordan curve (why ?). and 
moreover f(x) is tangent to L and vanishes nowhere on L. Whether or 
not the positive direction along L corresponds to the direction of 


"In writing £, +++ + £,, itis assumed that the final point pik = hate 
coincides with the initial point of Ly.) 


ni 
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increasing ¢ [and hence to the direction of f(x) itself], f(x) makes one 
turn in the counterclockwise direction as L is traversed in the positive 
direction (recall footnote 17)."! Therefore /,(L) = 1, as asserted. 


Corottary. Every closed trajectory of the system (7.7) contains at 
least one critical point of the system. 


Proof. Use property 5. 


DEFINITION 2. Let x° be critical point of f(x), and let L be any closed 
Jordan curve containing x° aud no other critical points.* Then the quantity 


T(x?) = IL) 


is called the index of x° with respect to f° 


THEOREM 2. Jf L is a closed Jordan curve passing through no critical 
points of the system (7.7), and if there are only finitely many critical 
points x*,...,X" inside L, then 


[(L) => 1L,(x). (7.30) 
k=l 


Proof. Draw extra curves inside L dividing the interior of L into 
subdomains, each containing only one 
critical point (sce Figure 41). This gives 
n closed Jordan curves £,,...,L,,; 
where, as always, the positive direction 
of traversing each L, is the counter- 
clockwise direction. It follows from 
(7.28) and (7.29) that 


LL) = 1 (Ls) 


no 


Ficure 4] 


(why ?). But this, together with Definition 2, implies (7.30). 


Problem 1, Let P(z) be a polynomial in z = x + iy of degree at least I. 
Writing P = Q + iR and examining /,(L), where f is the vector field (Q, 8) 
and is a sufficiently large circle with center at the origin, prove that P(z) has at 
least one zero. 


Problem 2, Prove that a node, a center or a focus has index |, while a saddle 
point has index —1. Prove that if L is a closed trajectory containing nodes, 


24 a es * esd F . * eos . J Pa z r 
For a rigorous proof of this intuitive geometric fact, see ¢.g., 5. Lefschetz, op. Cif. 
p. 199. | 
** This presupposes that x® is an isolated point. 
a ie Poe ren Pee i os eo: bat peseealt 3 
In making this definition, we tacitly rely upon property 4. 
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centers, foci and saddle points, then the total number of critical points inside L 
is odd and the number of saddle points is one less than the total number of all 
other kinds of critical points. 


Problem 3. [t can be proved that an isolated critical point x°, which is nota 
center or a focus, has a sufficiently small neighborhood -4° with the following 
structure: 4” contains a finite number of “elliptic sectors” completely filled by 
trajectories with x° at both ends, and a finite number of ‘hyperbolic sectors”’ 
completely filled by trajectories with the boundary of.4” at both ends. Both kinds 
of domains extend from x° to the boundary of -{° and are separated from cach 
other by “fans” completely filled by trajectories with one end at x? and the other 
end on the boundary of .#. If the number of domains of each kind is known, 
what is the index of the critical point? 


Problem 4. The concept of the index of a curve with respect to a vector 
field can be extended to the case of » dimensions. Let $ be a closed piecewise 
smooth oriented! (1 — 1)-dimensional surface, or a piece of such a surface 
satisfying similar requirements. Let 


fo) = VG), . 5 fale) 


be a continuously differentiable vector function defined in a neighborhood of 
S. Then by the index of S with respect to fis meant the quantity 


A A , Rh wh 


ao i b Se 2 . 
ON i ex i—} Ox i+] ax, 


vw 


[ eS ee ee ee ee ee ee ee ee cos (v, x xf] dS, 


Nw] + ) rl = 


L,(S) = 


ty 


p=] 


ran os 8 . bide n tear. e * . 
OX, OX; OXi.1 OX, 


where o,_, is the “volume” of the unit Gr — 1)-dimensional sphere, and ~ Is 
the exterior normal to S. Give examples illustrating the concept of index in three 
dimensions, making the appropriate geometric interpretation. Prove some 
properties of /,(S). 


Problem 5. Define the index of an isolated critical point of an n-dimensional 
autonomous system. Prove that the index equals +1 if the right-hand sides of 
the system are linear (what determines the sign?). What are the implications 
for the index of a critical point for a nonlinear system with a linear principal 
part? 


Problem 6. Consider a continuously differentiable field of tangent vectors 
on a three-dimensional sphere R, and a corresponding autonomous system on 
R. Define the concept of the index of a critical point, and prove that if there are 
finitely many critical points, then the sum of their indices equals 2. Use this 
and a suitable approximation to prove that every continuous field of tangent 
vectors on R has at least one null vector. Also study the case where K Is the 
surface of a torus or a “pretzel’’ (a sphere with two handles). 


“Pe. with the outside indicated. 
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58. Brouwer's Fixed Point Theorem 


We begin with a few general considerations, Two n-dimensional point 
sets K and K’ are said to be homeomorphic (to each other) if there exists.a 
continuous mapping 

xis: o(x)eK' (veXK) 


of K onto A’ such that the inverse mapping x = @7!(x") maps XK‘ continuously 
onto K.25 For example, the n-dimensional ball (solid sphere) is homeo- 
morphic to the a-dimensional cube or to the n-dimensional tetrahedron (or 
even to any convex n-dimensional body), but not to the set consisting of two 
disjoint balls. The properties common to all homeomorphic sets are studied 
in a special branch of mathematics called topology. From the topological 
point of view, homeomorphic sets are equivalent, just as congruent figures 
are regarded as equivalent in elementary geometry. Thus such concepts as 
angle, length, etc. are not topological. On the other hand, such “cruder’” 
properties as connectivity, dimension, etc., are topological (although the 
proofs are often far from easy). 
In proving the main result of this section, we shall need the following 


LeMMA (Sperner’s lenuna). Suppose a given triangle ABC is divided 
into a finite number of subtriangles such that any two subtriangles are 
either disjoint or else have a vertex or a side in common. Moreover, 
suppose the vertices of the subtriangles are all denoted by the letters 
A, B, and C in such a way that the letter A never appears on the side BC 
of the original triangle, the letter B never appears on the side AC and the 
letter C never appears on the side AB. Then there is at least one sub- 
triangle whose vertices are denoted by different letters. 


Proof. First we note that if an interval AB is divided into a finite 
number of subintervals, where cach point of division is denoted by 
either A or B, then the number of subintervals with end points denoted 
by different letters is odd. In fact, let p., be the number of subintervals 
with both end points denoted by A, let py, be the number of subintervals 
with one end point denoted by A and the other by B, and let pz be 
the number of points of division denoted by A. Examining each sub- 
interval in turn. independently of the others, we find that the end 
point A appears a number of times equal to 2p44 ~+ Pay (why 2). But 
this number must also equal 2p; +- 1, since each “interior” end point 
A is counted twice. Therefore 


2Paa + Paw = pi, + 1, 
and hence p’, is odd. 


© Such a mapping ts said to be ene-to-one and continuous in both directions. 
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Turning to the case of the triangle ABC, let p44, be the number of 
subtriangles with vertices A, A and B (in any order), and similarly 
for Pann and page. Moreover, let py, be the number of sides of 
subtriangles with end points A and B which lie inside the original 
triangle, and Ict pi, be the number of sides of subtriangles with end 
points A and B which lie on the boundary of the orieinat triangle. 
Examining the sides of cach subtriangle in turn, independently of the 
others, we find that the side AB appears a number of times equal to 
2Paanb 2Papp + Pane (why?). This number must also equal 2p} p + 
Pigs Since each “interior” side AB is counted twice, and therefore 


2Paan + 2Pasy + Pane = 2Pap > Plain. (7.31) 
But “boundary” oo AB can only he on the side, AB of the original 
triangle, and hence pf, is odd, by the argument given at the beginning 
of the proof. [t follows from (7.31) fat pac! is also odd. Thus pipe > 1, 
and the proof is complete. 


We are now in a position to prove a theorem widely used in the theory 
of differential equations and other branches of mathematics: 


THEOREM (Brouwer’s fixed point theorem)."* Let f(x) be a continuous 
mapping of an n-dimensional ball K into itself (a > 1). Then f(x) carries 
at least one point a & K into itself, t.e., f(a) = a. 


Proof. If a == 1, the theorem is an immediate consequence of the 
intermediate value theorem for continuous functions (why?). We now 
give the proof for = 2, where K is a disk, leaving the case of arbitrary a 
to the problems. 

First we note that if Brouwer’s theorem holds for any set K’ homeo- 
morphic to K, then it also holds for K. In fact, let f be a continuous 
mapping of K into itself and let 9 be a homeomorphic mapping of A’ 
onto K, Then g7!/a is a continuous mapping of A’ into itself, and hence 
has a fixed point a by hypothesis: 


a flolay} = a. 
But then 
/[ela)] = 9a), 


L.e., 9(@) is a fixed point of the mapping f. | 
This observation allows us to replace the disk K by a triangle ABC, 
seed by a continuous map ping of ABC into itself, Given any = > Q, 
we “triangulate’? ABC, ie., we divide ABC into subtriangles as in 
*6 Proved by the Dutch mathematician L. E. J. Brouwer in LOLO. An equivalent asseruon 
was proved and applied to the theory of differential equations by the Latvian mathematician 
P.G. Bohl in 1904, 
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Sperner’s lemma, all with sides of length less than ¢,. Suppose f carries 
no point of ABC into itself: Then we label the vertices of the subtriangles 
according to the following rule: A vertex is denoted by 4 if f carries it 
nearer to the side BC of the original triangle, by B if f carries-it nearer 
to the side AC, and by C if f carries it nearer to the side BC. If f carries 
a vertex nearer to two sides of the original triangle, say BC and AC, 
the vertex can be denoted by either A or B. Then it is easy to see that this 

> satisfies the conditions of Sperner’s lemma, and hence there is at 
least one subtriangle, say (4 BC),, whose vertices are denoted by different 


Now let ¢,,....¢,;--- be any sequence of positive numbers 
converging to zero, and carry out the above construction for every m1. 
By going over to subsequences, we can assume without loss of generality 
that the corresponding triangles (4 BC),, converge to a point a (why?). 
Since a is not fixed under the mapping /{ It must move away from one 
of the sides, say BC, of the original triangle. But then every point of the 
triangle (4BC),, moves away from BC if n is sufficiently large. This 
contradicts the construction of (4BC),,, thereby proving the theorem. 


Problem 1, Generalize Sperner’s lemma to 7 dimensions (proving that there 
is an odd number of subtetrahedra with appropriately labelled vertices), and then 
use mathematical induction to prove Brouwer’s theorem for arbitrary x. 


Problem 2. Prove Brouwer’s theorem for arbitrary 2 by the following alter- 
native method: First use the thoery of Sec. 57 to prove the theorem for sufii- 
ciently smooth mappings, and then approximate an arbitrary continuous 
mapping by smooth mappings. 


Problem 3. Use Brouwer’s theorem to show that no closed bounded u- 
dimensional set with interior points can be continuously mapped onto its 
boundary in such a way that every boundary point is carried into itself. 


59. Applications of Brouwer’s Theorem 


First we generalize the corollary to Theorem 1, p. 186 on critical points 
inside a closed trajectory: 


THEOREM 1. Given an n-dimensional autonomous system (7.2) and @ 
set K homeomorphic to an n-dimensional ball, suppose every trajectory 
of (7.2) beginning in K at 1-=0 stays inside K for all t > 0. Then kK 
contains at least one critical point of the system. 


arbitrary point x° € K into the point x(<,; x°). By hypothesis, there is a 
continuous mapping of K into itself, and hence, by Brouwer’s theorem, 


Proof. Given any =, > 0, consider the mapping carrying an 
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there isa point! € A such that x(7, ix!) = x1, i.e., a trajectory beginning 
at. x) returns to NX? alter a time: <5. Similarly Se 25 43 as 15 any 
sequence of positive numbers converging to zero, there is a point 
x” eK such that x(7,,3 4") = x". By going over to subsequences, we 
can assume without loss of generality that the sequence {x} converges 
toa point XE K, 

We now show that ¥ is a critical point, thereby completing the proof. 


Clearly 
o{[ 4 enue") =a 132 
n 


for any fixed ¢. The left-hand side of (7.32) approaches x(t; %) as m1 —- a 
(why ?), while the right-hand side appreaches ¥. Therefore 


A) ee 
for all f, 1.e., ¥ 1s a critical point, as asserted. 
Next we prove a result related to Theorem 2, p. 171 on cycles:* 


THEOREM 2. Given a three-dimensional autonomous system (7.2), 
let T be a torus such that every trajectory of (7.2) beginning inside T at 
t=0 stays inside T for all t > 0. Moreover, suppose every point x 
moving along a trajectory in T, as 
described by the system (7.2), has 
positive angular velocity about the 
axis of T.® Then T contains at least 
one cycle. 


Proof. Let the disk K be a 
cross section of 7, as in Figure 42, 
and let /.o be a trajectory beginning 
at a point x°€ K. By hypothesis, 
[io Stays inside T as ¢ increases. The Pico 
angular velocity of the pomts mov- 
ing about the axis of 7. being positive and continuous, has a positive 
minimum. Therefore, after a finite time, (jo intersects AK again in a 
point x! depending on x°: 

Aer ea 


** ‘The reader should try to generalize Theorem 2, which is a rather special result in three 
dimensions (chosen only to illustrate the technique of applying Brouwer's theorem). 

If the x-axis is chosen as the axis of the torus, the requirement on the angular 
velocity takes the form 


Xe dx; 


deo dt 


a My fp XaXy) — Xs fila Xp Xa) > 0. 
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ne function © is a mapping of K into ee playing the role of a 
succession function (see Sec. 55). Moreover, @ Is continuous, because of 
the continuous dependence of the solution on the initial conditions. It 
follows from Brouwer's theorem that there is a point ¥¢€ X such that 
o(¥) = ¥. Thus there is a cycle passing through ¥, and the proof is 
complete. 


Problem 1, Let the set K, with boundary ID, be homeomorphic to an ne 
dimensional ball, and suppose that for every point aéT there is an n-dimen- 
sional right circular cone P, (including its base) with vertex at @ such that 

1) P, is completely contained in K; 

2) P.,, depends eon miceely on a; 

3) The cones P,, (2 € VP) are all congruent. 


Moreover, given an #-dimensional autonomous system (7.2), suppose the vector 
f(a), app lied at an arbitrary point a € P, is never directed into Py. Prove that K 
contains at least one critical point. In particular, use this fact to prove that if 
Kis bounded by a smooth surface on which the velocity field is never directed 
along the exterior normal, then X contains at least one critical point. 


Problem 2, Use Brouwer’s theorem to find sufficient conditions for the 
existence of a periodic solution of the general system 


dy; : 
ae == fil Ji me 38 als i ries fs 


if the functions f; are all periodic in x with the same period. 


Supplement 


FIRST-ORDER PARTIAL 
DIFFERENTIAL EQUATIONS 


This supplement 1s devoted to the theory of first-order partial differential 
equations involving one unknown function. A basic feature of the theory is 
that all solutions of such equations reduce to integration of ordinary 
differential equations. Just how this reduction is accomplished will be 
described below. 


60. Semilinear Equations 


We begin by considering equations of the form 


6 Ou 
© O(Nins6 bak oe ee Oo a) = 0, (1) 
fev] ON; 


where it is assumed that 


1) The coefficients a,(x,....,X,) have continuous first derivatives with 
respect to all their arguments on some domain G in (xy,..-+¥x) 
space ; 


2) The inequality 


holds on G; 
3) The function b(x,,....%,.%) has continuous first derivatives with 
respect to all its arguments for (x,,...,¥,) € @ and Jul <M, 
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In particular, condition 3 holds if b(Xy,....X,, 0) 1s a lmear function of x 
with coefficients which have continuous first derivatives with respect to all 
the x, and in this case equation (1) ts called /inear. In general, the unknown 
function x appears in O(x,......¥,, 4) ina nonlmear way, and then equation 
(1) is said to be semilinear. 

Now consider the system of ordinary differential equations 


dx, 12 Oa aaa et 


; ._ eerere | (2) 
ds 


nt 

DUO me why) 

Je] 

Because of conditions | and 2, the right-hand sides of (2) have continuous 
first derivatives with respect to ail the x; Hence there is one and only one 
integral curve of the system (2) passing through each point of G (the parameter 
s equals the arc length along the integral curve). These integral curves are 
called characteristic curves of equation (1). 


THEOREM | (Uniqueness theorem). If the function u(x,,...,%,) 
has continuous first derivatives and satisfies equation (1) on a domain G, 
then all the values of u(x,,....X,) along an are of a characteristic curve 
H, where |u| <M, are uniquely determined by its value at any point 
(xi... 5X9) of the arc. 


Proof. Dividing both sides of (1) by Va? +--+ + a@® and taking 
account of (2), we obtain 


S a, due b 
i=] eae 23+ | fat : a 
Vaitostaiax, Vale ta (3) 
du dx; 
a a 
. eo" . o +, aA 
10x, ds y, ils eas ok’ OME OC Va? setae a 
where the replacement of 
* Ou dx; 
=10x, ds 


by du/ds is justified by the assumed continuity of the first derivatives of 
uw.) Let H be a characteristic curve passing through a point (a4, ..-» Xn) 
of G, and suppose |u(x?, ,X°)| < M. Then 


5 Ses, 3°) PS Noe wl (4) 


along H, where +, and its first derivatives are continuous in s and all 
the x? Substituting (4) into 


ert te, - 


‘ (ORE te ems Aer i reer 3 


VQz ch oct Pe a 


‘See e.g., D. V. Widder, Advanced Calculus, second edition, Prentice-Hall, Inc., 
Englewood Cliffs, N.J. (1961), 55: 
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we find that w satisfies the differential equation 


du | : 
—— oo b(s, u, Xs Saige hg x? (5) 


ds 
along //, where © has continuous first derivatives with respect to all its 
arguments If [ul < Af (cf. Sec. 21). Therefore the value of u at any point 
ofan arc of HT where |u| < M is uniquely determined by its value at any 
point of the arc, in particular at the point (x°,..., x°). 


THEOREM 2 (Existence of a solution of the Cauchy problem). Let S 
be an (n — 1)-dimensional surface lying in the domain G* and suppose S 
has a continuously turning tangent plane. Suppose further that S is not 
tangent to any of the characteristic curves of equation (1). Let f be any 
function defined on S such that 

1) The absolute value of f is less than M; 

2) Each point of S has a neighborhood in which f can be represented 
as a function of n — \ of the coordinates x,,...,X,, with con- 
tinuous first derivatives with respect to these coordinates. 

Finally suppose that the surface S has a neighborhood Ry such that 

1) Ro is contained in G; 

2) A characteristic curve passing through any point of the surface S 
does not intersect S again when continued in both directions inside 
Ry, aud only one are of a characteristic curve passes through each 
point of Ry; 

3) Given any point (xo, ..., x9) of the surface S, the solution of equation 
(3) satisfying the initial condition u(O) = f(x{,.... x2) can be 
continued along the entire are of the characteristic curve inside Ro, 
without exceeding M in absolute values 

Then there exists a function u(xy,....%,) defined on Ro such that 


1) w has continuous partial derivatives with respect to all the x;; 
2) wis an “integral surface.” i.e., u satisfies equation (1); 
3) uv satisfies the “initial data,” i.e., u coincides with f on S. 


Proof. By solving the Cauchy problem for equation (1), we mean 
finding a function u(x,,....,,) with the above properties. For simplicity, 
we confine ourselves to the case of two independent variables (171 = 2), 
leaving it to the reader to extend the proof to the case 7 > 2. To avoid 
the use of subscripts, we write equation (1) fora = 2 inthe form 


a(x. vy) = - b(x, vy) = + e(x, y, 2) = 9. (6) 
Ox Oy 
*S can be either closed or open; in the latter case, S is not regarded as including tts 
boundary. 
* It is assumed that s = 0 on S. 
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Correspondingly, equations (2)-(5) become 


dx a(x. ¥) dy b(x, y) (7) 

ds Vat + bt ds /a* -- > 
Ae CRS 2) 
— + os = 9, (8) 
ds Var -+ b* 

x= o5,X 9), = Us, x°, 9°), (9) 

az re 
one VCE, ar): (10) 
ds 


The “surface’’ S on which fis defined is now a curve in the xy-plane. 
Thus, i this case, the Cauchy problem has the following geometric 
interpretation: Find an integral surface z = z(x, y) satisfying equation 
(6) and passing through a given spatial 
curve S whose projection onto the xy-plane 
is S (see Figure 43). Since the value of 
= along every characteristic curve H is 
uniquely determined by the value of z at 
the pointin which # intersects S, we should 
be able to find z(x, ¥’) by examining all such 
characteristic curves. 

We now pursue this idea in more detail. 
On every characteristic curve H intersecting 
Sina point dy, we determine the function 
z which satisfies equation (10) and takes the 

FiGuRE 43 same value at Ay as the given function /- In 
general, the function z cannot be determined 
in this way on the whole characteristic curve H, since in continuing zZ 
along H we may leave the domain of values of = where c(x, y, z) is defined 
Or we may intersect S twice. However, by hypothesis, we can determine 
= ona whole neighborhood R, of S. Thus all that remains is to prove 
that the function z(x, }) so constructed has continuous derivatives with 
respect to x and y. Then the relation 


erence emis ee 
anon eer semetiot 


Ox ds  @yds — ds 


will hold on R,, and hence z will salisfy not only equation (10), but 
equations (8) and (6) as well. 

Before proving the existence and uniqueness of the derivatives of 2 
with respect to x and yo at any point A = (x, vy) of the domain Ry, we 
introduce new curvilinear coordinates in a neighborhood of A as follows. 
Suppose the characteristic curve H drawn through A intersects S in the 
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point Ay = (x", y°). To be explicit, we assume that the tangent at A, to 

the curve S 1s not parallel to the y-axis (note that the argument given 

below remains in force if the roles of the x and y-axes are reversed). 

Then the arc of S near A, can be represented by an equation of the form 
y? — F(x°), 


where the function / has a continuous derivative. On the other hand, 
since the functions ¢9(s, x°, y°) and (s, x°, y®) have continuous deriv- 
atives with respect to s, x°, and y® (cf. Sec. 29), the functions 


ofs, x°, F(x°)] = o(s, x°), U[s, x%, F(x%)] = Us, x°) 
have continuous derivatives with respect to s and x°. Let s and x® be the 
new coordinates. Then, as we now show, the Jacobian of the functions 
x = o(s, x°), vy = Us, x°) is nonvanishing. In fact, these functions 
satisfy the system of ordinary differential equations (7), which for brevity 
we write as 


Substituting ® and } for x and y, we obtain the following identities in 
Sand x: . . 
do(s, x) 


ad Oe 4 
(P[a(s, x°), U(s, x°)], 
ds 
7 (11) 
db(s, x”) cd, pe eee ai 
DU = lols, x), Us, x )). 
ds 
Since the functions 6, J, ®, and ¥ have continuous derivatives with 
respect to all their arguments, the right-hand sides of the identities (11) 
have continuous derivatives with respect to s and x°, and hence the same 
is true of the left-hand sides. Therefore, differentiating both sides of (11) 
with respect to s, x°, and setting 


de ~ au ; 00 ~ au r 
—+—= Do, — = Dy T= Dio; = 4 Dy, 
ds ds Ox Ox 


we obtain? 


iD @F .. oF .- 
Qe peep, Wee), 
ds ce) ay 


‘ Here we rely on the following theorem (see ¢.g., T. M. Apostol, op. cif» P- 121): 
Given a function {defined on a domain G in the xy-plane, suppose the partial Saari 
fx f, and f, exist and are continuous on G. Then fy. exists everywhere In G, and is equal to 


rye 
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Since the coefficients 6/dg,..., O/dy do not depend on p, the 
PincuOAs Dw and D, satisfy the same system of linear differential 
equations for both p = 0 and p = I. Therefore the Jacobian 


ax” as | 
HV’ = 2 


ab du 

Gx” as 
which is a Wronskian. is nonzero on the whole are H if and only if it 
is nonzero at point Ay where the characteristic curve intersects S (cf. 
Sec. 33). But at this point 


; 
, as} _ a) dF 28 
| dF aul as dx’ as’ 


dx as | 
Where the expression on the right is the cosine of the angle between the 
normal at A, to the curve S and the tangent to the characteristic curve 
H at the same point Ay. multiplied by a nonzero factor (why ?). By hy- 
pothesis, this cosine is nonzero, and hence the Jacobian HW is nonzero 
everywhere on f/. 
Therefore, by the implicit function theorem, the system of equations 


Slee. jp tese) 


can be solved for s and x° in a neighborhood of H. Moreover, since 9 
and 1 have continuous derivatives with respect to s and x°, the implicitly 
defined functions s and x° will have continuous derivatives with respect 
to x and y.° Hence to prove that the function z constructed earlier on the 
domain Ry has continuous derivatives with respect to x and y on Rp, 
we need only show that if z is regarded as a function of s and x° ina 
neighborhood of H, then it has continuous derivatives with respect to 
sand x®. But this can be seen as follows: Substituting F(x°) for y? in 
the right-hand side of equation (10), we find that the function z satisfies 
an equation of the form 

es TS Syke), 

ds 
Where the function y has continuous derivatives with respect to all its 
arguments. Moreover, by hypothesis, the function f giving the initial 


* See c.g., T. M. Apostol, op. cit., p. 147. 
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values of the function z for s = 0 (on the curve S) has a continuous 
derivative with respect to x°. Therefore, using the theorem of Sec. 21, 
we find that z has continuous derivatives with respect to s and x° ina 
neighborhood of H, thereby completing the proof of Theorem 2. 


Remark 1, Suppose the surface S and the function f defined on S satisfy 
the first two conditions of Theorem 2. Then it is easy to verify the existence 
of a neighborhood AR, satisfying the conditions of the theorem. In fact, we 
need only choose the set Ry formed from sufficiently small arcs of character- 
istic curves passing through points of S. This guarantees the existence of a 
solution of the Cauchy problem in a sufficiently “thin” neighborhood of S. 


Remark 2. Unless it is assumed that the functions a, and b have con- 
tinuous derivatives, equation (1) may not have a solution with continuous 
derivatives. For example, following N. M. Gyunter, consider the equation 


dz . dz . 
= + — = b(x — 9), (12) 
Ox ay 

where b(w) is Weierstrass’ continuous nowhere differentiable function.? We 


Introduce new variables uv, v and w by setting 
X-+yp=sv, x— pow, 2x, p) = ule, w). 


Suppose that on some domain G of the xy-plane there exists a solution 
2(x, ') of equation (12) which has continuous derivatives with respect to x 
and y.* In terms of the new variables, (12) becomes 


The solutions of this equation are all given by the formula 


| 
u(u, Ww) = . b(w)e + chy). 


Where c(i) is an arbitrary function of i, Le., 


ta? 
eed 


l 
eM) Se Ne a= y) + e(x — ¥). (1 


But it is easy to see that there is no domain in the xv- plane where the functio 


_~ 


= given by (13) can have derivatives with respect to x and y. In fact, if see 


* See eg. E. C. Titchmarsh, The Theory of Functions, second edition, Oxford University 

Press, London (1939), p. 351. | 

* By a sofaion of (12) is meant a function 2(x, ¥) with partial derivatives 6z/éx and 
d2/0) everywhere in G which satisfy (12). 
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such derivatives at the points (x, ) and (x +- 2, "++ ¢), then so would the 
function 
z(x + 6,» -+ €) — 2(x, )) = eb(x — y), 


which is impossible. In other words, z(x, 1") cannot satisfy equation (12 
contrary to our original assumption. 

It can be shown® that every continwous solution of equation (12) is of 
the form (13), even if it is not required that the solution have continuous 
derivatives. Thus we arrive at the conclusion that there is no domain in 
which equation (12) has a continuous solution. 


Problem I. Prove that if the iriual data are specified on a characteristic 
curve, then equation (6) either has no solution or all or else has infinitely many 
solutions. When does each case occur? 


Problem 2. Show that if 2 = 2 and if the domain is simply connected, then 
the solution of a linear equation can be continued onto any domain consisting 
of points of characteristic curves drawn through S. Give an example showing 
that this is not always possible if the domain ts not simply connected. 

Comment. If mn > 2, the continuation is not always possible even for a 
simply connected domain. 


Problem 3. The theory becomes much simpler for generalized solutions of 
equauon (1), by which we mean here continuous functions satisfying equation 
(3) along every characteristic curve. Consider generalized solutions of the 
equation 


dz 
cease 


Ox 
in some domain & of the xy-plane, if the intitial data are specified on a piecewise 
smooth curve S contained in R. Find conditions for the existence of a (gener- 
alized) solution of the Cauchy problem, and conditions for the existence of a 
unique solution. Find conditions for the null solution to be the only solution 
corresponding to zero initial data. (How does this requirement differ from the 
preceding one?) Can we always infer the uniqueness of the solution in a smaller 
domain & from uniqueness of the solution in a larger domain containing R? 
Try to extend these results to the equation 

se 

ale, = = + dt, Ys =Q (a? + 6 > 0), 


where @ and 6 are sodieans digereniiable functions. Give an example of 
such an equation in a domain R, whose solutions are all constants. 


Problem ¢. Give examples of cases where several characteristic curves go 
through certain points. Consider the second equation of Prab. 3 in the half- 
plane 0 < x < & with continuous Cauchy data on the x-axis, where a and 4 are 
continuous functions, @ + Oand itis assumed for simplicity that sup ibjal - 


“R. Baire, Sur les fonctions de variables réelles, Annali di Mat. (3), 3, 1 (1899). 
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Show that the (generalized) solution, if it exists, must be unique. Show that a 
solution exists if the characteristic curves can be uniquely continued in the 
direction of decreasing x, but otherwise a solution may not exist. What is a 
necessary and sufficient condition for existence of a solution of the Cauchy 
problem ? What new features appear if b/a is not required to be bounded? 


61. First Integrals of a System of Ordinary 
Differential Equations 
According to the preceding section, the system consisting of the ordinary 
differential equations (2) and the equation 
dt b 
bar meresas are ee 9) 


ds Vay ee 


(where s is a parameter) defines a family of integral curves in (xy, ~~.) Xn. ¥) 
space, Which make up integral surfaces 


| Conner ee 


of equation (1). These ordinary differential equations can be written in the 
symmetric form 


dx dX ie: du - 
ty iy a, Dd 


It is sometimes easy to find functions 

Gl Rei ceu el) 
which are not identically constant and have constant values along every 
integral curve of the system (14). Such functions are called first integrals 
of the system (14). 

For example, consider equation (6) involving two independent variables. 

In this case, the family of integral curves making up the integral surfaces 1s 
determined by the system 

ON OY MS, (15) 

a b —¢ 
Suppose we have managed to find two first integrals 

o(x.y,c) bn 5) 


of the system (15) such that at least one minor of order two of the matrix 
Go Go G9 


Ox ay Gz 


au ab ay 


Ox Oy dz] 
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is nonzero at every point of a domain G, in (yx, . 2) space." Then the system 
of equations 


o(x, y=) == (x, 7 29), vv ez) = YOY, 2°) (16) 


determines a curve L in the domain G,, since (16) determines two coordinates 
as functions of the third in a neighborhood of every point where both 
equations are satisfied. This curve will in general consist of several disjoint 
pieces, However, we shall assume that in the domain G, under consideration 
every curve determined by the system (16) consists of only one piece. By the 
definition of a frst integral, the function o(x, ¥, 2) has the constant value 
o(x°, y°, 2°) along the integral curve ZL of the system (15) passing through the 
point (x°, y®, 2°), and similarly U(x. y, 2) has the constant value U(x, y® 2°) 
along L. Therefore the curve determined by (16) must coincide with L in G,. 
It follows that the system of equations 


ARKD=C, OG y= (17) 


represents a complete integral of the system (15) in the domain G,, since by 
choosing the values o(x*, y®, 2°) and u(x®, »”, 2°) for the constants C, and C,, 
we obtain the ean curve of the system (15) passing through any point 
(x°, y°, =°) of G., i.e., we obtain any integral curve of the system. 

We now sadte the problem of findin ng the integral surface of equation 
(6) passing through the curve 


=r), y= 80), == 0), (18) 


where it 1s assumed that the functions «, 8, y are continuously differentiable 
and that the determinant 


a(z, B) 


b(s, i) 


never vanishes. Geometrically, the second condition means that the curve 
= 4(v), j= 8(v’) 


in the x}-plane is never langent to a characteristic curve of equation (6). 
Under these conditions, it follows from the basic theorems of Sec. 60 that in 
some neighborhood of the curve (18) there is one and only one integral 
surface of equation (6) passing through the curve (18). Since, as we have seen, 
this integral surface is made up of integral curves of the system (15), passing 


F cnhinemnaniaal 


* The analysis of the general system (14) requires # first integrals, and will be left to the 
reader. 
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through points of the curve (18), the parametric Bats for the required 
integral surface (with parameter v) can be found by substituting a(v), S(v) 
and +(v) for x°, y° and z® in the right-hand sides of the equations (16): 


ox, ¥, 2) = o[a(v), B(v), yv)] = PC), 
Yeas yy 2) = Yale), BO), ¢Co)] = FO). 


Remark. Every first integral 7(x, y, z) of the system (15) in the domain 
G, is a function of (x,y,z) and b(x, y, 2). In fact, by the definition of a 
first integral, 7(x, V7, 2) must anaue ¢ a constant value along every integral curve 
of (15). But, as shown above, such a curve is uniquely determined by the 

values which the functions (x, y, z) and (x, y, 2) take on the curve. 


Example. Find the integral surface of the equation 


Oz Le 7 
2—-+3—-+5=0 (6’) 
Ox oy | 
passing through the curve 
Y= Aye, ¥ = dal, Z = ql, 


where the constants dy, dz and a3 are such that 


a, 2 
am 'Q), (19) 
is 3 
The system (15) now takes the form 
dx dy dz iy 
an: ae a 
Integrating the equations 
dx dy 
2° 3 
and 
i. ke 
y=. 


3x — 2y, Sx + 22. 
Since 


a8) 

on 
A?) Px 
fad 

| 

Iw 
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the equalions 
ay 2} oe Ci; Sa se Dec 


a 
= 


represent 2 complete integral of (15’) in all of (x, 1. 2) space. For each pair of 
‘alues of the constants C, and C., the system (7°) determines a curve (a 
straight line) consisting of only one piece. Therefore the required integral 


of (6) is determined by the equations 


JX — 2y = Baye — Page, 


Dy +- Qo = Saye “ 2g. 


To eliminate c from these equations, we solve the first equation for u [this is 
possible because of the condition (19°)], and then substitute the resulting 
value of v into the second equation, 


Problem 1. Show that in general, Knowledge of & functionally independent 
first integrals of the system (4.4) allows us to reduce the number of unknown 
functions in the system by &. 

Problem 2. In physical problems, first integrals often express conservation 
laws. Use the law of conservation of total energy, to find a first integral of the 


system 
dx We ; 0 
cece ee Oe Nm -- OXY = U 
dt dt 


equivalent to the equation 


8 
(a 
H—> + bx = 0 
di- 


dr 


for free oscillations of a parucle in the absence of friction. 


62. Quasi-Linear Equations 


We now consider equations of the form 


. 
is Ou 20 
> A(X. fey Nay N) ames wt A(x), * @ @ » Ay it) — 0, (20) 
ae | Ox, 
where it js assumed that 
I) The functions a(x)... , Nis W) and b(x;,..., x,, u) have es 
lirst derivatives with respect to all their arguments on some domain 
PAN te. X,,.W) space: 


2) The ineg uality 


vs 


2 Gracie an Me NNO 
ty 


holds on G. 
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Starting from a known solution u(X;,...,%,) of (20) with continuous first 
derivatives, we form the auxiliary system of ordinary differential equations 


ux : a[X1, s 44 y Naas uCXy, ee ey x,)] * . 
Ts = a, Lo | eee | (21) 
F > ajlx,, ae 8 yg X ns u(x, se bs x,,)] 


where the parameter s cquals the arc length along the projection of the 
integral curve onto the plane uw = 0, Substituting u(x, ..., ¥,) into (20) and 
dividing both sides of the resulting identity by 


eee mee eee eee 
Jf Sates Be are | ery ke 


i=l 
we find that 
ae ee Ox; no : 
pac Gere meme orn > Dai hnavetiel) 
j=] j=l 
_ ou dx; a D(X. eee en (22) 
ae = i ———— LL 
dx, ds a 
F 2,404, ree ga Xny it) 
j=) 
du. D( Xivs cea tne) 0 
= =f — 
ds no 
| Sei oo Sas u) 


Seite et 
pi ON, 
245 
j=l 


can suill be represented as the derivative of v along some direction, but the 
direction now depends on was well as on X4,.-+5%n. Together (21) and (22) 
constitute a system of ordinary differential equations 


ds nm, 
Secrecy 
du b(Xy, ee Xn) 


a. Log AN RE SR ERR ST ; 


ds 


" 
Ihe tae dhl) 
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whose integral curves 
< } @ . 
xX. = XAS, Ee se eg Nano if ); == ie he es NH, 
a Pa 
ese US Nye ees Nye We) 


are called characteristic curves of equation (20).! 

The statement of the Cauchy problem for equation (20) is the same as 
for equation (1): Find @ solution of equation (20) taking given values on some 
(1 — 1)-diniensional surface S in (X,,.....N,) space, or more generaily, find 
an n-dimensional integral surface T of equation (20) passing through a given 
(1 — 1)-dimensional surface S in (xy,... Xn.) space." First we prove the 
uniqueness of such a solution, assuming that the solution is continuously 
differentiable and that the surface S has no tangent lines whose projections 
onto the plane w = 0 have the direction cosines specified by (21).'* Since the 
initial values (fors = 0) of the x,and ware known at every point (xq, ..., X49.) 
of the surface S$, the system (23) uniquely determines the x, and vas functions 
of s, i.e., given any sufficiently small piece of S, uw is uniquely determined in 
the neighborhood of the projection of this piece onto the plane = 0 which 
is covered by the projections of the characteristic curves intersecting $. The 
“width’’ of this neighborhood never ‘‘shrinks to zero”’ since 


1) S$ has no tangents parallel to the u-axis;¥ 


2) By hypothesis, S has no tangent lines whose projections onto the plane 
u = Q have the direction cosines specified by (21). 


Thus any two integral surfaces passing through S coincide in some neighbor- 
hood of S, and the uniqueness is proved. 

To prove the existence of a solution of the Cauchy problem for equation 
(20), we consider ihe case of two independent variables x and y, as in Secs. 60 
and 61, denoting the unknown function by z = z(x,y). Then equations (20) 


Since semilinear equations are a special case of quasi-linear equations, there are two 
concepts of a characteristic curve for semilinear equations, one in the sense of Sec. 60, the 
other in the present sense. It is easily verified that the characteristic curves in the first 
sense are the projections of the characteristic curves in the second sense onto the plane 
i= 0, 

'' The surface T must represent a single-valued function u(x, ....- v,) satisfying equa- 
tion (20) sufficiently near any of iis points, but 7 may still be a multiple-valued function of 
Myre s Xn “in the large.’ For example, suppose 7 is that part of a helicoid with axis along 
the u-aXis which lies outside some neighborhood of the axis. 

* The direction cosines of the tangent lines to S are found by substituting the point of 
tangency into the right-hand sides of (21). 

. This follows from the fact that the derivatives @u/dx, are assumed to exist on 7, and 
existence implies finiteness (cf. p. 3). 
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and (23) become 


ras Oz : ; 
a(x, y, 2) ae -+ D(x, y, 2) 5, -e(x,y,z)=0 (a+ 5*>0), (24) 
x y 


dx a(x, ¥, 2) dy DG, ye dz c(x, , Z) é 
NG oe cacy SS — = ae (25) 
ds Ja® -- Db ds Ja? ar + +. p* dls va" a 


while the characteristic curves and the curve § are given by 


ne als, x9, y, zo), y= Ls 5, x, y, >) z= 4(s, x", y?, 2°) (26) 
and 


x9 = o(v), y° = A(v), 2° = y(v), (2° + BF 4 4% > 0), (27) 


where v is a parameter. We shall assume that all given functions are con- 
tinuously differentiabie and that the curve S$ lies in the domain of definition 
of the coefiicients a, b, c. Moreover, if we are interested in finding a single- 
valued solution rather than constructing an integral surface, it must also 
be assumed that the projection S of the curve S onto the plane wv == 0 has no 
multiple points. In this case, the two italicized conditions take the form 


fy t ver 


gt Be > 0, ¢'b — 8'a = O along S. (28) 


We now verily that equations (26) and (27) determine a surface T in some 
three-dimensional neighborhood of the curve S such that 
1) T has an equation of the form z = 2(x,¥) near each of its points; 
2) The function z(x, }‘) is continuously differentiable. 
Then it follows at once from (25), read in the reverse order, that 2(x, ¥) 
satisfies (24), and it is obvious that the initial conditions are satusfied. By the 
implicit function theorem, it is enough to show that 
Ax, ) 
As, v) 
on S$, since then we can solve the first two equations (26) for s and c ina 
neighborhood of — afterwards substitute the results into the third 


equation.!? But on S, i.e., for s = 0, we have x = «(v). y == 2(x), and hence 
Om Ov 
oa a bo re 
d(x, y) ds as | ee 0 
ee ee = i. a” “- h* a —- b” = TM 
ee N v reer 
As, v) On ety . Va Se bh 
Seo. ee ¢'(v) Q'(r) 
GU OU ax 


MRS 


“The functions 9, & and 7 are continuously differentiable because of the continuous 
dependence of the solution of the system (25) on the initial data. 
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because of (25) and (28). This proves the existence of a solution of the 
Cauchy problem. 

It should be emphasized that the existence of a solution is guaranteed 
only in a sufficiently small neighborhood of the curve S, which is not known 
in advance. The fact that this is an essential feature of the problem will be 
made clear in the next section. 


63. Generalized Solutions of Linear and Quasi-Linear Equations 


In Sec. 60 we constructed a solution of the Cauchy problem for a first- 
order linear partial differential equation, and then proved the uniqueness of 
the solution, assuming sufficient smoathness of the surface S, the function f 
defined on S and all the coefficients of the equation. Examples like the one at 
the end of Sec. 60 show that a solution of the problem may fail to exist if 
these assumptions are weakened. Thus, in many physical problems, it ts 
important to extend the concept of a solution of the Cauchy problem in 
such a fashion that a solution will exist and be unique, under weaker con- 
ditions on the smoothness of f and the coefficients of the equation. We now 
describe one possible way in which this can be done.'® 

For simplicity, we consider the Cauchy problem for the case of two 
independent variables ¢ and x, and a linear equation of the form 


Ou , ou 
E(u) =— + a(t, x) ae + b(t, x)u + c(t, x) = 0, (29) 
Ot x 


where a, 6 and c are continuously differentiable functions of ¢ and x. The 
initial data will be specified on a segment of the line ¢ = 0: 


u(O, x) = f(x). (30) 


Clearly, the line ¢ = 0 is not tangent to any characteristic curves of equation 
(29). If a function u(t, x) satisfies (29) on some domain G, then obviously 


[| L(u)F dt dx = 0. (31) 


where F is any continuously differentiable function vanishing in a neighbor- 
hood of the boundary of G. Conversely, if u(x, 1) is continuously differentiable 


. ** Concerning generalized solutions of linear and quasi-linear first-order partial 
differential equations, see eg, R. Courant and D. Hilbert, Methods of Mathematical 
Physics, Vol. 2, Interscience Publishers, New York (1962), Chap. 2, Appendix 2 and Chap. 
5, Sec. 9; O. A. Oleinik, Discontinuous solutions of nonlinear differential equations (in 
Russian), Uspekhi Mat. Nauk, 12, no. 3, 3 (1957): I. G. Petrovski, Lectures on Partial 
Differential Equations (translated by A. Shenitzer), Interscience Publishers, New York 
(1954), Chap. 2, Sec, 9. 
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on G and if equation (31) holds for an arbitrary function of the above Lype, 
then L(z) = 0, i.e., u(t, x) is a solution of equation (29) on G. In fact, if 
L(w) = Oat a point (1°, x”), then L(u) docs not change sign in some neighbor- 
hood Q of (79, x°). But this contradicts (31) if we choose F to be positive in 
Q and zero outside Q., 

We now write equation (31) in another form, using the identity 


= Bes kt tee a A) 
L(u) Pf = DFP )u + cF + = (uF) + a (auf), (32) 
ar Ox 
where, by definition, 
oo (| an: ae 
L*(f) = — — — —(aF) -+- bF. 
(F) 7 5, | ) 


Integrating (32) over G, using Green's theorem to transform the integral of 
the last two terms, and bearing in mind that / vanishes in a neighborhood of 
the boundary of G, we find that 


(|. Liu alassS (|. [L*(F)u + cf] dt dx. 
The equation 
I, [L¥(Pyu + oF] di dx = 9 (33) 


will serve as the starting point for our definition of a generalized solution of 
equation (29). It is important to note that (33) makes sense for nondifferentiable 
functions u(t, x), and even for discontinuous u(r, x). Here it will be assumed 
that discontinuities of u(t, x) and its first derivatives u, = du/ér and vu, = 
éu/Ox can occur only on a finite number of curves x = (7), sharing no 
points except possibly end points, where 9(¢) Is continuously differentiable. 
It will also be assumed that u[r. o(2) — e] and u[¢, e(7) -+ <] approach uniform 
limits as ¢ - 0-+ on any arc of the indicated curves, and similarly for u, and 
w,. Such functions u(r, x) will be called piecewise continuous. 


DEFINITION. A piecewise continuous function u(t, x) is said to be a 
generalized solution of equation (29) on the domain G if equation (33) 
holds for every continuously differentiable function F(t, x) equal ta zero 
ina neighborhood of the boundary of G. 


It is easy to see that a generalized solution u(s, x) of equation (29) can 
have discontinuities only along characteristic curves. In fact, suppose ¥ = a(t) 
Is a curve of discontinuity of u(t,x) and let FCs x) be any continuously 
differentiable function equal to zero outside a sufficiently small neighborhood 
Q of the curve x = a(t), We can assume that u(t, X) 1s continuously differ- 
entiable on each of the domains Q, and Q, into which the curve v= a(t) 
divides Q, and hence u(r, x) satisfies (29) on 2, and Q,. Using (33), we find 
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that 
O=|] [LAE + cF]drdy 


~~ 


Pe 
a: * + 


= ll [L*(F)u +- eF] dt dx +|| 


5 LE(P)u + cF] dt dx 


* 


* ; deg 
=| gl ~ att, 0] F at 


if 
where the last two integrals are taken along the curve x = 9(f), and 
[u] = ult, o() — 0) — eft, (rt) + 0). 


Since F is an arbitrary function on the curve x = 9(¢), vanishing of the last 
integral implies 

Bos a = 0, 

dt 
1.e., X = o(f) is a characteristic curve of equation (29). In the same way, the 
derivatives u, and uw, can have discontinuities only along characteristic curves. 
In fact, suppose x(f, X) is continuous along a curve x = (1) which is not a 
characteristic curve, and let Q, and 9, be the same as before. Then by 
Theorents 1 and 2 of Sec. 60, u(#, x) coincides both in Q, and . (the overbar 
denotes the closure) with the unique ordinary continuously differentiable 
solution, equal to ul, o(f)] on the curve x = o(r). It follows that u, and x, 
can have no discontinuities on x = 9(f). Moreover, it can easily be verified 
that any piecewise continuous function (rf, x) is a generalized solution if it 
satisfies equation (29) at points of continuous differentiability and has 
discontinuities only on characteristic curves. 

By a generalized solution of the Cauchy problem for equation (29) and the 
initial condition (30) on the domain G, we mean a generalized solution 
u(t, x) of equation (29) which coincides with the given. function /(x) at 
continuity points. It is easy to see that such a function satisfies the equation 

Ne [L*(F)u + eF] di dx —|" FO, x) f(a) dx, 
where G, is the intersection of G with the half-plane ¢ > 0, and the segment 
[@), ay] is the intersection of the closure of G with the line ¢ = 0. This 
equation can be taken as the definition of the generalized solution of the 
Cauchy problem for equation (29) and the initial condition (30) on the 
domain G. 

An important property of linear partial differential equations, proved in 
sec. 60, is the fact that the neighborhood Ry of the surface S, in which a 
solution of the Cauchy problem is guaranteed to exist, does not depend on the 


. 


function f specified on S. It can be shown that a generalized solution of the 
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Cauchy problem for equation (29) and the initial condition (30) exists in this 


f(x) is continuously differentiable everywhere except at a finite number of 
points where it and its derivatives have discontinuities of the first kind. This 
fact can be proved by the same method used in Sec. 60 to prove the existence 
and uniqueness of a solution of the Cauchy problem. 

Next we consider quasi-linear equations of the first order. In this case, 
examples show that unlike linear equations, the domain of definition of 
solutions of the Cauchy problem depends on the function fand its derivatives. 
For example, for the equation 


Ou Cu 
— -+- a 0, (34) 
Ot OX 


the solution of the Cauchy problem satisfying the condition 


- 


. x 
u(Q, x) = —tanh-, <> 0 


~- 
“ 
— 


is defined only for ¢ < ¢, since it is not hard to show that the projections onto 
the ¢x-plane of the characteristic curves corresponding to the initial function 
—tanh (x/s) intersect for ¢ > = and “‘carry different values of uw to the points 
of intersection.” 

By analogy with linear equations, we now introduce the concept of 
generalized solutions of first-order quasi-linear equations, confining our 
discussion to equations of the form!® 

CL CLCO SSO er eet (35) 
ot Ox 
By a generalized solution of equation (35) on a domain G we mean a piecewise 
smooth function u(t, x) such that the equation 


iP Ee et oF a(t, x, u) — b(t, x, OF | drdx =0Q (36) 
con Ol Ox 

holds for every continuously differentiable function F(t, x) equal to zero in 
a neighborhood of the boundary of G. The generalized solution of equation 
(35) satisfying the initial condition (30) at continuity points is called the 
generalized solution of the Cauchy problem (35), (30). Just as for equation 
(29), it can be shown that the generalized solution u(r, x) of equation (35) 
satisfies the condition 


do(t) — aft, x + 0, u(t. x -+ 0)] — aft, x — Out, x — 9)] 
dt u(t, x + O) — u(t. x — 0) 


© Because of their physical significance, equations of the form (35) are sometumes 
called conservation laws, The concept of generalized solutions of quasi-linear equations 
plays an important role in problems of gas dynamics (see the references cited in footnote 15). 
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on the curves of discontinuity x = a(t). It can also be shown that in general 
» curve of discontinuity of u(r, x) is not the projection of a characteristic 
curve of equation (35). 

Unlike the linear case, equation (36) does not uniquely determine a 
generalized solution of equation (35) satisfying the initial condition (30), 
For example, the generalized solution of the Cauchy problem for the equation 


ee (34') 


satisfying the initial condition (30) where 


1 i 0 


ay pe, eee Okth 
iO) = (38) 
is given by the following function u(t, x), defined for every « > I: 
, l—« 
I if x< t, 
2 
oe Lome 
—%, if ; Pet. 
u(t, x) = (39) 
a— | | 
a, i. Us = t, 
z 
vy SD 
— | if t<x 
2 


The projections of the characteristic curves of u,(7, x) onto the rx-plane are 
shown in Figure 44. Thus, to uniquely determine the generalized solution of 
the Cauchy problem for equation (35) on 
a domain lying in the half-plane 1 > Q, 
subject to the initial condition (30), (38), 
we need an extra condition, such as the 
requirement that the inequality 


u(t, x — 0) > u(t, x + 0) (40) 


hold on the curves of discontinuity, in 
the case where @%a/du is nonnegative." 
It is easy to see that (40) singles 
out one of the generalized solutions 
Ficure 44 (39), ic., the solution 14(!, 4), which, 


For the proof, see O. A. Oleinik, loc. cit, 


SEC. 63 FIRST-ORDER PARTIAL DIFFERENTIAL EQUATIONS 213 


according to (39), is just 


l if x < 0, 
iy (1, x) mes f 0 
mae tox > U. 


Remark I, In the case of quasi-linear equations in several independent 
variables, conditions guaranteeing the uniqueness of a generalized solution 
of the Cauchy problem (for some large class of equations) have not yet been 
found. 


Remark 2. Unlike the case of linear equations, a generalized solution of 
the Cauchy problem for the quasi-linear equation (35) cannot be constructed 
by using characteristic curves passing through the point (0, x, u(0, x)). For 
example, it is easy to sec that the generalized solution of the Cauchy problem 
for equation (34) with the initial condition (38) is not uniquely determined by 
characteristic curves even in an arbitrarily small neighborhood of the line 
f = Q. In fact, suppose characteristic curves 
of equation (34), i.e., solutions of the system 


of equations QREEK. 
epee 


du dx 


dt dt x 

5 tee aes sec ece 
are drawn in (¢, x, 4) space through the SK 
points (0, x, u(0,.x)), where -—o <x < Yn 
-- co, In the case of a smooth function, the } | 
characteristic curves make up a_ surface | 
corresponding to a solution of the Cauchy 
problem. However, if u(0, x) = f(x), where 
J (x) is given by (38), then all points of the 
‘x-plane lying between the lines x — ¢ = 0 FIGURE 45 
and x-+-¢=0 are covered twice (with 
different values of uw) by the projections of the characteristic curves onto the 
ix-plane if ¢ > 0, while the points between the same lines are not covered at 
all if ¢ << 0 (see Figure 45). 


=e | i 


Remark 3. There are other methods of constructing generalized solutions 
of quasi-linear equations. One of these, called the method of “small 
viscosity’” because of its meaning in problems of gas dynamics, ts of particular 
importance and is described in detail in Oleinik’s paper (/oc. cit.). In fact, 
the whole class of problems involving construction of generalized solutions 
of quasi-linear equations, continuation of the solutions for all 4 > 0 and 
conditions guaranteeing their uniqueness is of great importance in gas 
dynamics and other branches of mechanics. In gas dynamics, the curves of 
discontinuity of generalized solutions correspond to shock waves. Then 
conditions of the form (37) express the conservation of mass, energy and 
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momentunt in passing through a shock wave, while (40) EXpre esses the fact 


that the entropy can only increase 


Problem i : rs fine a general lized solution of the Cauchy problem. for linear 

partial differential equations of the first order with any number of independent 

variables. - reer that a initial function f(x) is piecewise continuous, prove 

the appropriate € exit and uniqueness theorem, and also derive the relations 
s of di ieeoniinuty of the generalized solutions. 


Problem 2. Use equation (33) to define a con pas generalized solution of 


he Cauchy problem for equation (12) of Sec ev 0, assuming that the initial 
function f(x) is continuous. Prove the existence @ — uniqueness of this gener- 


alized solution. 


Problem 3. Construct a gen ralized solution of the Cauchy problem for 
equation (34) in the domain f > @ satisfying the condition (40), assuming that 
f(x) is a piecewise constant fu netion with finitely many discontinuities [first 


in 


consider the case where f(x) has one or (Wo discontinuities]. Prove the uniqueness 


Problem 4. Prove that the condition (40) determines 2 unique ¢ eeneralized 
solution of the Cauchy problem for equation (3+) in the half-plane > 0. 


64. Nonlinear Equations 


t 


We now consider nonlinear equations of the form 


| Gu Gu 
F(xqee es Xa oe] == (); (41) 
\ GX, OX no 


e function F has continuous deriv atives up to order 2 (incl Jusive) 
with respect to all its arguments in some (27 — 1)- dimensional domain 


2) The inequality 


n ya 2 
a OF 7 (42 
r=} LO(ONjONX,) 
holds on G. 

For brevity, we introduce the notation 
GF | CF Gu OF ; 
~ aes IN ya 2 ca = = Diy ee P;. 
OX, ou ax, GP, 


' 


The Cauchy problem can be set up just as in Sec. 64. We begin by proving 
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the uniqueness of the solution of the Cauchy problem, under certain con- 
ditions, at the same time developing tools which will be used to prove the 
existence of a solution. 

Suppose u(x,,...,%,) is a solution of equation (41) with continuous 
second derivatives. Substituting this solution into (41), we differentiate the 
resulting identity with respect to each variable x,, obtaining the equations 


= OD, ; , Op 
> P, a Xs + Ups = > Pps + X; + Up, =0, (43) 


ter : es 
1 j t 


ipa 
QU 
“4 


which are quasi-linear in the p; Suppose we construct the integral’ curves in 
(X;,...,%,) space of the system 


dx, a 

Faas ce a (44) 
pag 
j=l 


where the solution u(x,,...,X,) and its derivatives are substituted for x and 
p; in the right-hand sides. Then equation (43) can be written in the form 


dp; X, + Up, 
org ie Sk hs (45) 
ds Das 
2; 
A j=l 


Next we form the derivative of u(x,,...,.2%,) along the direction s defined 
by (44): 


du gs Oudx; _ s P; p; 


ds i10x, ds ist Is p? 
N jet 


The system consisting of equations (44), (45) and (46) uniquely defines x,, p; 
and uw as functions of s, once their initial values are specified. The integral 
curves of this system in (X,, 0... Xy5 W6 Py.---s P,,) Space are called character- 
istic curves of equation (41), and depend only on this equation.’ 

Now let S be the surface on which the initial values of uv are specified, 
and suppose S has a continuously turning tangent plane. If Sis given by the 
equations 

Der | ee one? se Pree E 


then the function wv can also be regarded as a function of ry,..., €,., on S. 


'‘S The projections of these curves onto the plane (4),.....%.) or onto the plane 
(xy... ., Xy, dé) are also sometimes called characteristic curves, 
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We shall assume that 


1) The functions xv, .-- 


,U_—-1) and ua, . 


SUPPLEMENT 


265 Unix) are continuous to- 


gether with their partial derivatives up to order 2; 


2) At least one minor of order un — \ of the matrix 


ex, OXn 
| dv, Cv, | 
i ax OX» 
OU, Ov,—1 | 


is nonzero at every point of S. 


It is easy to see that the initial conditions determine the values of p; on S, 
although admittedly not uniquely. In fact, the formula uv = u(x,..., x,) 
becomes an identity if we replace u and all the x, by their expressions in terms 
of vy,....0,-1, and then differentiation of this identity gives 
" te 7 
Apa 4 jel,...,a-—1 (47) 
i=1 Gu, dv, 
(where, of course, the p, are evaluated on S). On the other hand, (41) must 
hold on S, 1.e., 

FR 8 hate Dine tes pay (48) 
if we replace x,,..., X,, « by their expressions in terms of v,, ..., U,_y. To- 
gether (47) and (48) form a system of » equations in the unknown functions 
Pu -+++P, of the arguments ty,..., U,_,. The implicit function theorem can 
be applied to this system, provided that 


Py Pa: Sen 


ut 
Ox; OXe Ox, 
GX, OX: OX, 

| dv,_) Gv. Ov, y 


for all the relevant values of v,,..., U,-1, Le., everywhere on S. Therefore, 
from now on we shall assume not only that u(v,,..., 0,-1) is specified on Dj 
but also that values of Pir ++ +5 Px Satisfying (47)-(49) have been chosen on 5, 
where, according to the implicit function theorem, Dig hse pp Are continuous 
and have continuous derivatives with RESPEC (Otic cg ecg page 

It should be noted that in general, because of the nonlinearity of equation 
(48), the values of p; on S fora given ware not uniquely determined. However, 
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if all the above conditions are satisfied, and if 
CU) poax. 2. 2:12) . m 
P; (A) = P; (A), i a L, ae 2 gy tl 


at any point A ofS, then it follows from the implicit function theorem and the 
condition (49) that 

py” = py, P= ],....M. 
Geometrically, the condition (49) means that at every point (x°,..., x°) of S, 
the projection onto the plane (x,,..., x,) of the characteristic curve passing 
through the point (x7, ..., 42, 4°, pt, .... ph), where u°, p®,..., p? are deter- 
mined from the initial conditions, is not tangent to S (verify this). 

The uniqueness of the solution of the system (44)-(46) now implies the 
uniqueness of the solution of the Cauchy problem. [n fact, since F is assumed 
to have continuous derivatives up to order 2 with respect to all its arguments, 
it follows from (42) that the right-hand sides of (44)-(46) have continuous 
first derivatives with respect to all their arguments. and this guarantees the 
uniqueness of the solution of the system (44)-(46). 

Next we turn to the proof of the existence of a solution of the Cauchy 
problem in some neighborhood of the surface S, assuming that S and the 
function uw specified on S have the two properties In italics on p. 216, and 
moreover that values of p,,...,p,, satisfying (47)-(49) have been chosen 
on S. Just as in Secs. 60-62, we confine ourselves to the case of two inde- 
pendent variables. Then equation (41) takes the form 


PCS pg) 0: (50) 
where 
wee: ade 
Oe: i oy 


while the system of equations for the characteristic curves [in the five- 
dimensional space of points (x, y, 2, p. g)] becomes 


ds VP?+0° ds <P" +Q 
eet EI ip ee ey (52) 
ds JP? + O° ds VP" + QO" 


dz pP + gO 
ds / PP - OQ" . 
where 
OF OF . OF | OF 0 oF 


yo OS yo, Z=—, P=>—, Q==—. 
Ox Gy dz Gp dq 
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The initial conditions are determined by the relations 
yes x0); y <= y(t). . = z(t), P — ptr), q = q(v), 


where the first three functions are twice continuously differentiable and 


(=). (“> 0 


} al phe ‘ d De 


while the last two functions are continuously differentiable, and satisfy 
equation (50) and the conditions 


dx dy dz — 
ee, (54) 
dv dude | 
| ee ©: 
dx dy =), (55) 
duo dv 


To prove the existence of a solution under these conditions, we need only 
verify that the soluuons’* 


xv), (5,0), Ae, pr, 9&2) (56) 


constructed from the system (51)-(53) and the initial data satisfies the 
following conditions: 


1) The system of equations x = x(s.v), Y= ¥(s, v) can be uniquely 
solved for s and ¢ in some neighborhood 4” of the curve S, and the 
solution has continuous derivatives with respect to x and y. Then s 
and t can be chosen as curvilinear coordinates in 4”, and if we replace 
sand v in z(s, v) by their expressions in terms of x and y, we obtain z 
as a continuously differentiable function of x and 3.” 


2) The functions (56) satisfy equation (50) everywhere in 1". 


a 
e 


* 
# 


) The identities 


p= =, g = uz (57) 
Gx Gy 


ae 


hold everywhere in 1” 


** It is assumed that s = Oon S, 
| ** By hypothesis, the functions x(z), ..., g(v) specified on S have continuous derivatives 
with respect to v, and the right-hand sides of (51)-(53) have continuous derivatives with 
respect to all their arguments. Therefore the functions (56) solving the system (51)-(53) 
have continuous derivatives with respect to s and cv. ” 
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To prove condition 1, we need only show that the determinant 


| ds as 
(58 
-— (58) 
Ov dav 


is nonzero for all sufficiently small [s|. Since the clements of (58) are con- 
tinuous in s and v, it is enough to show that (58) is nonzcro on S itself. But 
this follows from (55) and (51). 

As for condition 2, it obviously holds on the curve S, since the initial 
values of p and g have been chosen to satisfy (50) on S. To prove that (50) 
holds not only on the curve S (i.e., for s = 0), but also for all sufficiently 
small |s|. we show that substitution of the solution (56) of the system (51)-(53) 
for the quantities x, }', Zz, p, g appearing in the left-hand side of (50) leads to an 
expression which does not depend ons. In fact, 

Ca aye a7 ea. 

ds ds ds ds ds 
and replacing dx/ds, ..., dg/ds by the right-hand sides of (51)-(53), we find 
that dF/ds equals zero. 

To verify condition 3, we proceed indirectly, first proving that the two 
equations 


ak Ox oy ae 
Gv av ov 

Oz ax Oy 
ee es ee: (60) 
ds c Os : Os 


Since, according to (51), 
OX P Oy ) 
as VP?+0° ds P+? 
equation (60) coincides with (53). As for equation (59), so far we only know 
that it holds for s = 0, since the initial values of p and q on the curve S have 
been chosen to satisfy (54). To prove that (59) is satisfied for nonzero S, 
we write 
uy Oz Ox oy 
tS | 
du dv ou 
and then calculate dU/ds. The differentiation with respect to s is easily justified. 
In fact, as already noted, the solution (56) of the system (31)-(53) has con- 
tinuous derivatives with respect to s and v, Therefore, ul (56) is substituted 
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into (S1)-(33), the right-hand sides of the resulting identities will have 
continuous derivatives with respect to s and e, and hence so will the left-hand 
sides, i.e., the derivatives 


ds Ov. Oscu as Av 


pe oO" O7: 
ON ‘u < 


dU CG": Gp cx dq CV p o"x o*y (61) 
ee PT Sy 61). 
ds @vés @s@v Gs dv du ds dv as 


We now differentiate the identity 


Gz Gx 0} 
<= _ jp —q==0 
Gs Gs Cs 


(valid everywhere in of” by the argument just given) with respect to. v, 
obtaining 


a9 } 0 an? 
6°: = dpox  aqay O-x a : 
ee er = 0. (62) 
Gsce Gv ds dtu os Os Ov Gs Ov 


dU épéx @aqdy apex  aqay 


Se amnataadl 
‘tices mmmeenpni, calm ewer MMe, ee SI ream | SRNR 


ds Ge Gs Gu ds os du as av 


or 
dU @p P _ ag O Ox X+pZ , oy Y+rqZ (63) 
Se er eee oe” Shag ane che ee ea me oe 
ds @vVP?>+Q* @evP*+Q* avVP°+Q° dvvP"+Q 
after using (51) and (52). Moreover, differentiating the identity 
FOE YG) ==) 
with respect to cv, we obtain 
SOM xe ON « whl oo oROP 06 4 
xi 2y 2g ap eo =o. (64) 
Gv du du dv dv 


Dividing both sides of (64) by VP? + Q® and subtracting the result from (63), 
we find that 


dU Zz dz Ox 0y\ ZU 
— See fe aaa ——— 3 
ds V PY + OF \au av dv PP ae 


and hence 


5 iis oat Oa 


Since U(0) = 0, it follows that U(s) = 0 for the other values of s. 
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Thus, finally, we have proved that the identities 


Oz Ox oy Oz Ox dy 
— = p—+4¢ —=p-—+gs 5 
oy Oe ape a ag Pe oe om 


hold in the whole neighborhood ./. Noting that 


dz _ dz dv , dz ds dz _ 92 dv , dz Os (66) 
dx vax ' asdx’ dy advady | dsay’ 


and substituting (65) into (66), we obtain 
2 (> Ox 22) Ov | ( Ox ay) 2 Ox , ay 


ax \P av ao ax Bos ta a oe oe 
and similarly 
az _ 
dy ~ 


Therefore condition 3, p. 218 is satisfied, and the proof of the existence of a 
solution of the Cauchy problem is complete. 


Remark. Instead of using s, the projection of the length of the character- 
istic curve onto the plane (x,,...,*%,) as the parameter, we can introduce 
a new parameter 7, related to s by the formula 


ds = |S dt. 


J2=1 


Then the system (44)-(46) can be written in the form 


and so on. 


Example. Find the solution of the equation 
(2) + ! Ge 1=0 (67) 
Ox Oy 


passing through the circle 
+ y? = I, rae 0 (68) 
Introducing a parameter v, we write (68) in the form 
x=cosv, posiney, 2=0. (69) 
In this case, equations (51)-(53) become 


Oe OG, (70) 
2p 2q WAp*+q*) O 0 
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and the last two equations imply p = © Cy, g = Cy, where C, and Cy are 
constants. Substituting these values into the first three equations, we obtain 


1 * x co NS 1 co ak sw, pd | 
y=lCy eC, ys ly C, 2 = AC + Cot + 
where Cs, C, and C, are constants. To satisfy (67), we must have 


Ci+Cy= 1, (71) 
and hence 
z= 2t+ C;. 
Moreover, 
C,= cost, C,=sinr, C, = 0, 
since the curve 


— pss 2 
x=2Ct+Cy, p= WCyt + Cy 2 = 2(Cy + Cytt+ Cs 


must go through the point determined by the parameter v on the circle (69). 
Therefore the ‘integral surface of equation (67) passing through the circle 
(69) is given by 


=2Cr+cost, p=2C t+ sinv, z= 21, (72) 


where ¢ and v are parameters. 
To sausfy the condition 


Gz Ox y 
_ = Poors S 
Gt Cv At 


for ¢ = Q, it Is necessary that 
—psinvu+gcosv =0 
or 
C, sine = C, cos v, 
and hence 
C,=ecost, Cy=esine, wheree = -, 


because of (71). Continuity considerations show that < must have the same 


value along any characteristic curve, and hence the equations (72) take the 
form 


N= (Qte + l)cosv, y= (2te + l)sine, 2 = 20. 
Elimination of ¢ and v gives 
x? = y* = (1 = 2)*. (73) 
Thus we have finally found two integral curves of equation (67) passing 


through the circle (68). These are two eneuiat cones in (x, y', 2) space with the 
circle (68) or (69) as base and the z-axis as common axis. 


Remark, The above exampl ¢ shows that the requirement that the deter- 
minant (58) be nonzero only in the immediate vicinity of the curve S is an 
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essenuial feature of the problem, since 


d(x, \) 
d(t, v) 


anishes for f= -: 4. Suppose the projections onto (x. y, 2} space of the 
integral curves of the system (51)-(53) passing through the curve S (in the 
notation of Sec. 62) are given by 


y=xx5y"), y= y(s, x, py), z= 2(s, X°, yp’), (74) 


Then it must not be thought that the curves (74) form a smooth surface for 
arbitrarily large values of the parameter s (or of the equivalent parameter 1). 
For example, despite the fact that the integral curves of the system (70) are 
determined by the system for arbitrarily large ¢, the integral surfaces of 
equation (67) passing through the circle (69) cannot be continued indefinitely 
without eventually arriving at singular points, i.c., the vertices of the cones 
(73). Singularities of this type cannot occur in the case of quasi-linear 
equations, since the characteristic curves of such equations do not intersect 
1 CXijy ees atl) Space. 


Problem 1, Clarity the relation between the characteristic curves introduced 
in this section and those of Secs. 60 and 62. 


Problem 2. A characteristic curve might be thought of as a curve in 
(X,,...,X,, &) space through each point of which passes a plane whose normal 
has direction numbers p;...., Pn, ~1. Prove that this plane is tangent to the 
curve itself at each point of the curve. 


Problem 3 (A. Haar). Let G de the pyramid in @y,...,.¥,) space (2 > 2) 
defined by the inequalities 
O< x, <a, max tly,l,...,l¥aul}<a@—-%x, (a> 0), 
and let Ad be the base of G. Suppose u(x,,..., X,) ts a differentiable function 
defined on G satisfying the inequality 
Ou eu 


OX» 


Ox 


me 


u-—l 
= 


ote Jee + 8 (6 - 0). 


et 


f=] 


Prove that then 


min {min uw, O| ef — &(eF — 1) uy, eee Mn 
( A | | | 
< max fmax w, O| ef + d(e72 — 1) 

| ow j 


everywhere in G. 
Hint. Consider the point where the function 


Bu Se i sea le = 1)] (2 > 0) 


takes its maximum. 
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Problem 4. Use Prob. 3 and Hadamard’s lemma to prove (under suitable 
conditions) that 
a) The solution of equation (41) is unique; 
b) The solution of (41) depends continuously on the initial data and the 
form of the equation. 
Hint. For simplicity, assume first that the initial conditions are specified 
for x, = 9. 


65. Pfaffian Equations 


An equation of the form 
Pdx+ Qdy+ Rdz =Q, (75) 


where P, Q and R are functions of x, y and z, is called a Pfaffian equation 
(in three dimensions). There are two approaches to the study of such 
equations. In the first approach, x, y and 2 are regarded as functions of a 
single parameter f. Then specifying two of the variables x, y and z as functions 
of 1, we arrive at an ordinary differential equation determining the third 
variable. We can also require x, y and z to obey an arbitrary relation of the 


form D(x, y, 2) = 0. (76) 


Then, regarding x, y and z in (76) as functions of a parameter ¢ and difleren- 
tiating with respect to ¢, we obtain 
ay aU here ay ey (77) 
Ox oy Oz 
Under very general assumptions concerning ®, P, Q and R, equations (75) 
and (77) can be solved for the ratios of any two differentials to the third, e.g., 
for dy/dx and dz/dx. This gives a system of two ordinary differential equations 
determining y and z as functions of x. In general, the condition (76) leaves 
one arbitrary constant in the general solution. 
In the second approach to Pfaffian equations, one of the variables x, y 
and z, say z, is regarded as a function of the other two. Suppose R #0 in 
the domain under consideration. Then (75) implies 


dz = P, dx +- Q, dy, (78) 
where 
P 
P, —_— eo R ‘ 0, = : r 
and hence ee 
ay = P,(x, ys Z); (78') 
Oz a 
ay = Q,(x, y, Zz). (78 ) 
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Suppose it is known in advance that z has continuous second derivatives with 
respect to x and y, while P; and Q, have continuous first derivatives with 


respect to all their arguments. Then 


OP a OP, oz om 0, 4 IQ, 92 
Oy  dzdy ax dz Ox 


or 
ais OP, _ Q, d2; » 
dy dz 0 ax oz 
since 


O72 2. 
o “ 


— 


Oxdy dyax- 


(79) 


In differentiating P,; and Q, with respect to x and y, we have taken into 
account not only the explicit dependence of P, and Q, on x and y, but also 
their implicit dependence on v and y via the variable z (which, by hypothesis, 
is a function of x and y). The condition (79) can also be written in the form 


00 4 (2 oP | & 22) 
p|—= —-—]} + — — —} + RE— — =} = 0. 80 
| dz oy g Ox Oz Oy ax 80) 


THEOREM. Suppose the condition (80), or equivalently (79), is 
satisfied identically on a three-dimensional domain G, where P, and Q, 
have continuous derivatives up to order 2 (inclusive) with respect to all 
their arguments. Then one and only one integral surface of equation (78), 
or equivalently of (75), passes through each point (x°, y°, 2°) of G. 


Proof. If we set y = y® in (78'), the resulting equation determines 
a unique integral curve L in the plane y = y® passing through the 
point (x°, y°, 2°). Similarly, equation (78") with x assigned a consiant 
value C determines a unique integral curve /g in the plane x = C 
passing through the point of L lying in this plane. Then the set of 
curves /,. constructed in this way for all points of L uniquely determines 
a surface S passing through the SOINE Ss 2°), If the surface S 
has equation z = z(x, )’), then z(x, y) has continuous derivatives up to 
order 2, because of the continuous dependence of the solution of (78") 
on the parameter C and on the initial data. | 

Next we show that S is actually an integral surface of (78). The fact 
that z(x, p) satisfies (78") is obvious from the construction of S, and 
similarly z(x, y) satisfies (78’) for y == ¥°. To show that 2(x, y) satisfies 
(78') for other values of y, we consider the function 

Gz 


Fox — — P,(x, 3, 2) 
ake 
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and evaluate dF/¢éy, obtaining 


oF 8 (22) _ 26 - OP, @z _@Q, , 0Q, 02 OP, OP, Az 


\oxn/ Gy ey @x G2 ax ay — az ay 


“pec 


a y CG ¥ 


gx =s Gz | @t ey Cs a 


where the formulas 
dz CP Oz 
21e) -2(&), = = Cy 
Oy\dx! = ax \ey: oy 


have been used. Substituting (72) into (74), we find that (74) reduces to 


OF 80s p 
oy Cz 


and hence 


a 
a Oey ee al 1)! exp oO, dy}. 
| Jy® @e 


Therefore F(x, y) vanishes for all y under consideration, since it vanishes 
for y = °. This completes the proof. 


Remark. Geometrically, the first way of solving Pfaffian equations 
corresponds to constructing curves orthogonal to the given three-dimensional 
direction field (with directions specified by the vector with components 
P, Q and R). The second approach corresponds to constructing surfaces 
orthogonal to the same direction field (or equivalently, surfaces with given 
tangent planes at every point of space). 


Problem, Consider the n-dimensional Pfaffian equation 
be 
: a(x, geen Xn) dx; = 0, (81) 
ro] 


with appropriate pte a concerning the coefficients a; In how many 
different ways can this equation be treated? Analyze in most detail the approach 
in which the “integral manifolds’ are ( — 1)-dimensional surfaces. 


Hint, It is convenient to write (81) as a system 


On 
oye DA iy isha kp ll); i Sk gue ath 
ore 


where u = x,, 
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